SSB-polymerase fusion proteins

ABSTRACT

Fusion proteins comprising a single strand DNA binding protein and a nucleic acid polymerase (e.g. DNA polymerase or reverse transcriptase). These high fidelity proteins are suitable for use in nucleic acid amplification methods, including the polymerase chain reaction (PCR).

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of U.S. patent application Ser. No. 11/222,029 filed Sep. 9, 2005 now abandoned, the contents of which are entirely incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention provides fusion proteins comprising a single strand DNA binding protein (SSB) and a nucleic acid polymerase, and methods for using such fusion proteins in nucleic acid synthesis reactions.

2. Background of the Invention

DNA polymerases synthesize DNA molecules that are complementary to all or a portion of a nucleic acid template, typically a DNA template. Upon hybridization of a primer to a DNA template to form a primed template, DNA polymerases can add nucleotides to the 3′ hydroxyl end sequence of nucleotides of the primer in a template-directed (i.e. depending upon the sequence of nucleotides in the template). Thus, in the presence of deoxyribonucleoside triphosphates (dNTPs) and a primer, a new DNA molecule, complementary to all or a portion of one or more nucleic acid templates, can be synthesized.

DNA polymerases that exhibit increased yield and/or enhanced fidelity (i.e., more accurate template-directed polymerization) are useful in nucleic acid synthesis, amplification and sequencing reaction. The compositions and methods described herein provide such DNA polymerases.

SUMMARY OF THE INVENTION

The present invention provides an isolated or purified single stranded binding protein (SSB)-nucleic acid polymerase fusion protein. In one embodiment, the nucleic acid polymerase is a DNA polymerase. In another embodiment, the nucleic acid polymerse is a reverse transcriptase. The nucleic acid polymerase may be thermostable. In one embodiment, the thermostable DNA polymerase is Thermococcus zilligi (Tzi) DNA polymerase. In another embodiment, the SSB is thermostable. In one aspect of this embodiment, the SSB is Sulfolobus solfataricus (Sso) SSB.

The present invention also provides a nucleic acid molecule encoding a SSB-nucleic acid polymerase fusion protein. In one embodiment, the nucleic acid polymerase is a DNA polymerase. In another embodiment, the nucleic acid polymerse is a reverse transcriptase. The nucleic acid polymerase may be thermostable. In one embodiment, the thermostable DNA polymerase is Thermococcus zilligi (Tzi) DNA polymerase. In another embodiment, the SSB is thermostable. In one aspect of this embodiment, the SSB is Sulfolobus solfataricus (Sso) SSB.

An expression vector is also provided which comprises the nucleic acid molecule described above operably linked to a heterologous promoter. The present invention also provides a host cell comprising this expression vector.

The present invention also provides a nucleic acid molecule encoding a Sulfolobus solfataricus (Sso) SSB-Thermococcus zilligi (Tzi) DNA polymerase fusion protein.

Another embodiment of the invention is a method of increasing the yield of a polymerase reaction on a target nucleic acid comprising contacting the target nucleic acid with a primer which specifically hybridizes thereto, and an isolated or purified SSB-nucleic acid polymerase fusion protein; and incubating the resulting mixture under conditions whereby said primer is extended by said polymerase. In one embodiment, the nucleic acid polymerase is a DNA polymerase. In another embodiment, the nucleic acid polymerse is a reverse transcriptase. The nucleic acid polymerase may be thermostable. In one embodiment, the thermostable DNA polymerase is Thermococcus zilligi (Tzi) DNA polymerase. In another embodiment, the SSB is thermostable. In one aspect of this embodiment, the SSB is Sulfolobus solfataricus (Sso) SSB.

The present invention also provides an isolated or purified Sulfolobus solfataricus (Sso) SSB-Thermococcus zilligi (Tzi) polymerase fusion protein.

BRIEF DESCRIPTION OF THE DRAWING

FIG. 1 is a photograph of an agarose gel showing a comparison of Sso SSB-Tzi polymerase fusion protein to PfuUltra (Stratagene), Phusion (Finnzyme) and iProof (Bio-Rad) in PCR target amplification. PCR was performed with 100 ng K562 cell DNA template. Targets tested were A) Rhod 462 bp, B) p53 1494 bp, C) Rhod 2497 bp, D) Rhod 3123 bp, and E) Rhod 3871 bp. Sso SSB Tzi polymerase fusion protein amplified all targets with the greatest yields.

DETAILED DESCRIPTION OF THE INVENTION

The invention is based, in part, on the surprising discovery that yield and/or fidelity of a DNAP can be improved via fusion of a single strand DNA binding protein (SSB) with a DNA polymerase. Described herein are such SSB-DNA polymerase fusion proteins, as well as methods for nucleic acid sequencing, amplification (e.g., PCR) and synthesis using such SSB-DNA polymerase fusion proteins.

DEFINITIONS

The following terms are commonly used by those skilled in the art of molecular biology.

Nucleic acid. In general, a nucleic acid comprises a contiguous series (a.k.a., “strand” and “sequence”) of nucleotides joined by phosphodiester bonds. A nucleic acid can be single stranded or double stranded, where two strands are linked via noncovalent interactions between complementary nucleotide bases. A nucleic acid can include naturally occurring nucleotides and/or non-naturally occurring base moieties). A nucleic acid can be ribonucleic acid (RNA, including mRNA) or deoxyribonucleic acid (DNA, including genomic DNA, recombinant DNA, cDNA and synthetic DNA). A nucleic acid can be a discrete molecule such as a chromosome or cDNA molecule. A nucleic acid can also be a segment (i.e. a series of nucleotides connected by phosphodiester bonds) of a discrete molecule.

Template. A template is a single stranded nucleic acid that, when part of a primer-template complex, can serve as a substrate for a DNA polymerase. The template can be DNA (for DNA-directed DNA polymerase) or RNA (for RNA-directed DNA polymerase). A nucleic acid synthesis mixture can include a single type of template, or can include templates having different nucleotide sequences. By using primers specific for particular templates, primer extension products can be made for a plurality of templates in a nucleic acid synthesis mixture. The plurality of templates can be present within different discrete nucleic acids, or can be present within a discrete nucleic acid.

Templates can be obtained, or can be prepared from nucleic acids present in biological sources. (e.g. cells, tissues, body fluids, organs and organisms). Thus, templates can be obtained, or can be prepared from nucleic acids present in bacteria (e.g. species of Escherichia, Bacillus, Serratia, Salmonella, Staphylococcus, Streptococcus, Clostridium, Chlamydia, Neisseria, Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas, Mycobacterium, Helicobacter, erwinia, Agrobacterium, Rhizobium and Streptomyces), fungi such as yeasts, viruses (e.g., Orthomyxoviridae, Paramyxoviridae, Herpesviridae, Picornaviridae, Hepadnaviridae, Retroviridae), protozoa, plants and animals (e.g., insects such as Drosophila app., nematodes such as C. elegans, fish, birds, rodents, porcines, equines, felines, canines and primates, including humans. Templates can also be obtained, or can be prepared from, nucleic acids present in environmental samples such as soil, water and air samples. Nucleic acids can be prepared from such biological and environmental sources using routine methods known by those of skill in the art.

In some embodiments, a template is obtained directly from a biological or environmental source. In other embodiments, a template is provided by wholly or partially denaturing a double-stranded nucleic acid obtained from a biological or environmental source. In some embodiments, a template is a recombinant or synthetic DNA molecule. Recombinant or synthetic DNA can be single stranded or double stranded. If double stranded, the template may be wholly or partially denatured to provide a template. In some embodiments, the template is an mRNA molecule or population of mRNA molecules. In other embodiments, the template is a cDNA molecule of a population of cDNA molecules. A cDNA template can be synthesized in a nucleic acid synthesis reaction by an enzyme having reverse transcriptase activity, or can be provided from an extrinsic source (e.g., a cDNA library).

Primer. A primer is a single stranded nucleic acid that is shorter than a template, and is complementary to a segment of a template. A primer can hybridize to a template to form a primer-template complex (i.e., a primed template) such that a DNAP can synthesize a nucleic acid molecule (i.e., primer extension product) that is complementary to all or a portion of a template.

Primers typically are 12 to 60 nucleotides long (e.g. 18 to 45 nucleotides long), although they may be shorter or longer in length. A primer is designed to be substantially complementary to a cognate template such that it can specifically hybridize to the template to form a primer-template complex that can serve as a substrate for DNAP to make a primer extension product. In some primer-template complexes, the primer and template are exactly complementary such that each nucleotide of a primer is complementary to and interacts with a template nucleotide. Primers can be made by methods well known in the art (e.g. using an ABI DNA Synthesizer from Applied Biosystems or a Biosearch 8600 or 8800 Series Synthesizer from Milligen-Biosearch, Inc.), or can be obtained from a number of commercial vendors.

DNA polymerase (DNAP). A DNA polymerase is an enzyme that can add deoxynucleoside monophosphate molecules to the 3′ hydroxy end of a primer in a primer-template complex, and then sequentially to the 3′ hydroxy end of a growing primer extension product according to an RNA or DNA template that directs the synthesis of the polynucleotide. For example, a DNA polymerase can synthesise the formation of a DNA molecule complementary to a single-stranded DNA or RNA template by extending a primer in the 5′-to-3′ direction. DNAPs include DNA-dependent DNA polymerases and RNA-dependent DNA polymerases. A given DNAP may have more than one polymerase activity. For example, some DNA-dependent DNA polymerases, such as Taq, also exhibit RNA-directed DNAP activity. DNAPs typically add nucleotides that are complementary to the template being used, but DNAPs may add noncomplementary nucleotides (mismatches) during the polymerization or synthesis process. Thus, the synthesized nucleic acid strand may not be completely complementary to the template. DNAPs may also make nucleic acid molecules that are shorter in length than the template used. DNAPs have two preferred substrates: one is the primer-template complex where the primer terminus has a free 3′-hydroxyl group, the other is a deoxynucleotide 5′-triphosphate (dNTP). A phosphodiester bond is formed by nucleophilic attack of the 3′-OH of the primer terminus on the α-phosphate group of the dNTP and elimination of the terminal pyrophosphate. DNAPs can be isolated from organisms as a matter of routine by those skilled in the art, and can be obtained from a number of commercial vendors.

Some DNAPs are thermostable, and are not substantially inactivated at temperatures commonly used in PCR-based nucleic acid synthesis. Such temperatures vary depending upon reaction parameters, including pH, template and primer nucleotide composition, primer length, and salt concentration. Thermostable DNAPs include Thermus thermophilus (Tth) DNAP, Thermus aquaticus (Taq) DNAP, Thermotoga neopolitana (Tne) DNAP, Thermotoga maritima (Tma) DNAP, Thermatoga strain FjSS3-B.1 DNAP, Thermococcus litoralis (Tli or VENT) DNAP, Pyrococcus furiosus (Pfu) DNAP, DEEPVENT™ DNAP, Pyrococcus woosii (Pwo) DNAP, Pyrococcus sp KOD2 (KOD) DNAP, Bacillus sterothermophilus (Bst) DNAP, Bacillus caldophilus (Bca) DNAP, Sulfolobus acidocaldarius (Sac) DNAP, Thermoplasma acidophilum (Tac) DNAP, Thermus flavus (Tfl/Tub) DNAP, Thermus ruber (Tru) DNAP, Thermus brockianus (DYNAZYME™) DNAP, Thermosipho africanus DNAP, Thermococcus zilligi (Tzi) and mutants, variants and derivatives thereof (see e.g., U.S. Pat. No. 6,077,664; U.S. Pat. No. 5,436,149; U.S. Pat. No. 4,889,818; U.S. Pat. No. 5,532,600; U.S. Pat. No. 4,965,188; U.S. Pat. No. 5,079,352; U.S. Pat. No. 5,614,365; U.S. Pat. No. 5,374,553; U.S. Pat. No. 5,270,179; U.S. Pat. No. 5,047,342; U.S. Pat. No. 5,512,462; WO 94/26766; WO 92/06188; WO 92/03556; WO 89/06691; WO 91/09950; 91/09944; WO 92/06200; WO 96/10640; WO 97/09451; PCT WO03/025132; U.S. Provisional Patent Application Ser. No. 60/647,408, filed Jan. 28, 2005; Barnes, W. Gene 112:29-35 (1992); Lawyer, F. et al (1993) PCR Meth. Appl. 2:275-287; and Flaman, J. et al. (1994) Nucl. Acids Res. 22:3259-3260). Other DNAPs are mesophilic, including pol I family DNAPs (e.g., DNAPs from E. coli, H. influenzae, D. radiodurans, H. pylori, C. aurantiacus, R. Prowazekii, T. pallidum, Synechocysis sp., B. subtilis, L. lactis, S. pneumoniae, M. tuberculosis, M. leprae, M. smegmatis, Bacteriophage L5, phi-C31, T7, T3, T5, SP01, SP02, S. cerevisiae, and D. melanogaster), pol III type DNAPs, and mutants, variants and derivatives thereof.

RNA-directed DNA polymerases (reverse transcriptases) are enzymes having reverse transcriptase activity (i.e., that catalyze synthesis of DNA from a single-stranded RNA template). Such enzymes include, but are not limited to, retroviral reverse transcriptase, retrotransposon reverse transcriptase, hepatitis B reverse transcriptase, cauliflower mosaic virus reverse transcriptase, bacterial reverse transcriptase, Tth DNA polymerase, Taq DNA polymerase (Saiki, R. K., et al. (1988) Science 239:487-491; U.S. Pat. Nos. 4,889,818 and 4,965,188), Tne DNA polymerase (WO 96/10640 and WO 97/09451), Tma DNA polymerase (U.S. Pat. No. 5,374,553) and mutants, variants or derivatives thereof (see e.g., WO 97/09451 and WO 98/47912). Some RTs have reduced, substantially reduced or eliminated RNase H activity. By an enzyme “substantially reduced in RNase H activity” is meant that the enzyme has less than about 20%, more preferably less than about 15%, 10% or 5%, and most preferably less than about 2%, of the RNase H activity of the corresponding wild type or RNase H+ enzyme such as wild type Moloney Murine Leukemia Virus (M-MLV), Avian Myeloblastosis Virus (AMV) or Rous Sarcoma Virus (RSV) reverse transcriptases. The RNase H activity of any enzyme may be determined by a variety of assays, such as those described, for example, in U.S. Pat. No. 5,244,797, in Kotewicz, M. L., et al. (1988) Nucl. Acids Res. 16:265 and in Gerard, G. F., et al. (1992) FOCUS 14:91. Particularly preferred polypeptides for use in the invention include, but are not limited to, M-MLV H− reverse transcriptase, RSV H− reverse transcriptase, AMV H− reverse transcriptase, RAV (rous-associated virus) H-reverse transcriptase, MAV (myeloblastosis-associated virus) H− reverse transcriptase and HIV H− reverse transcriptase (see U.S. Pat. No. 5,244,797 and WO 98/47912). It will be understood by one of skill in the art that any enzyme capable of producing a DNA molecule from a ribonucleic acid molecule (i.e., having reverse transcriptase activity) may be equivalently used in the compositions, methods and kits of the invention.

Nucleotide. A nucleotide consists of a phosphate group linked by a phosphoester bond to a pentose (ribose in RNA, and deoxyribose in DNA) that is linked in turn to an organic base. The monomeric units of a nucleic acid are nucleotides. Naturally occurring DNA and RNA each contain four different nucleotides: nucleotides having adenine, guanine, cytosine and thymine bases are found in naturally occurring DNA, and nucleotides having adenine, guanine, cytosine and uracil bases found in naturally occurring RNA. The bases adenine, guanine, cytosine, thymine, and uracil often are abbreviated A, G, C, T and U, respectively.

Nucleotides include free mono-, di- and triphosphate forms (i.e., where the phosphate group has one, two or three phosphate moieties, respectively). Thus, nucleotides include ribonucleoside triphosphates (e.g., ATP, UTP, CTG and GTP) and deoxyribonucleoside triphosphates (e.g., dATP, dCTP, dITP, dGTP and dTTP), and derivatives thereof. Nucleotides also include dideoxyribonucleoside triphosphates (ddNTPs, including ddATP, ddCTP, ddGTP, ddITP and ddTTP), and derivatives thereof.

Nucleotide derivatives include [αS]dATP, 7-deaza-dGTP, 7-deaza-dATP, and nucleotide derivatives that confer resistance to nucleolytic degradation. Nucleotide derivatives include nucleotides that are detectably labeled, e.g., with a radioactive isotope such as 32P or 35S, a fluorescent moiety, a chemiluminescent moiety, a bioluminescent moiety or an enzyme.

Primer extension product. A primer extension product is a nucleic acid that includes a primer to which DNAP has added one or more nucleotides. Primer extension products can be as long as, or shorter than the template of a primer-template complex.

Amplifying. Amplifying refers to an in vitro method for increasing the number of copies of a nucleic acid with the use of a DNAP. Nucleic acid amplification results in the addition of nucleotides to a primer or growing primer extension product to form a new molecule complementary to a template. In nucleic acid amplification, a primer extension product and its template can be denatured and used as templates to synthesize additional nucleic acid molecules. An amplification reaction can consist of many rounds of replication (e.g., one PCR may consist of 5 to 100 “cycles” of denaturation and primer extension). General methods for amplifying nucleic acids are well-known to those of skill in the art (see e.g., U.S. Pat. Nos. 4,683,195; 4,683,202; and 4,800,159; Innis, M. A., et al., eds., PCR Protocols: A Guide to Methods and Applications, San Diego, Calif.: Academic Press, Inc. (1990); Griffin, H., and A. Griffin, eds., PCR Technology: Current Innovations, Boca Raton, Fla.: CRC Press (1994)). Amplification methods that can be used in accord with the present invention include PCR (U.S. Pat. Nos. 4,683,195 and 4,683,202), Strand Displacement Amplification (SDA; U.S. Pat. No. 5,455,166; EP 0 684 315), Nucleic Acid Sequenced-Based Amplification (NASBA; U.S. Pat. No. 5,409,818; EP 0 329 822).

Single stranded DNA binding protein (SSB). Single stranded DNA binding proteins (SSBs) are proteins that preferentially bind single stranded DNA (ssDNA) over double-stranded DNA in a nucleotide sequence independent manner. SSBs have been identified in virtually all known organisms, and appear to be important for DNA metabolism, including replication, recombination and repair. Naturally occurring SSBs typically are comprised of two, three or four subunits, which may be the same or different. In general, naturally occurring SSB subunits contains at least one conserved DNA binding domain, or “OB fold” (see e.g., Philipova, D. et al. (1996) Genes Dev. 10:2222-2233; and Murzin, A. (1993) EMBO J. 12:861-867), such that naturally occurring SSBs have four or more OB folds.

Thermostable SSBs bind ssDNA at 70° C. at least 70% (e.g., at least 80%, at least 85%, at least 90% and at least 95%) as well as they do at 37° C., and are better suited for PCR applications than are mesophilic SSBs. Thermostable SSBs can be obtained from archaea. Archaea are a group of microbes distinguished from eubacteria through 16S rDNA sequence analysis. Archaea can be subdivided into three groups: crenarchaeota, euryarchaeota and korarchaeota (see e.g., Woese, C. and G. Fox (1977) PNAS 74: 5088-5090; Woese, C. et al. (1990) PNAS 87: 4576-4579; and Barns, S. et al. (1996) PNAS 93:9188-9193). Recently, there have been reports on the identification and characterization of euryarchaeota SSBs, including Methanococcus jannachii SSB, Methanobacterium thermoautrophicum SSB, and Archaeoglobus fulgidus SSB, as well as crenarchaeota SSBs, including Sulfolobus sulfataricus SSB and Aeropyrum pernix SSB (see e.g., Chedin, F. et al. (1998) Trends Biochem. Sci. 23:273-277; Haseltine C. et al. (2002) Mol. Microbiol. 43:1505-1515; Kelly, T. et al. (1998) Proc. Natl. Acad. Sci. USA 95:14634-14639; Klenk, H. et al. (1997) Nature 390:364-370; Smith, D. et al. (1997) J. Bacteriol. 179:7135-55; Wadsworth, R. and M. White (2001) Nucl. Acids Res. 29:914-920; and in U.S. Patent Application 60/147,680.

Ordinarily skilled artisans can purify SSBs (including archaea SSBs), make recombinant variants, and can measure SSB activity using routine methods, such as those disclosed in Haseltine C. et al. (2002) Mol. Microbiol. 43:1505-1515. A non-comprehensive list of known SSBs, with GenBank Accession numbers, is provided in Table 1. Any of these SSBs, including homologs and variants thereof, may be used in the present invention.

TABLE 1 gi|18978392 Replication factor A related protein [Pyrococcus furiosus DSM 3638] gi|15679384 Replication factor A related protein [Methanothermobacter thermautotrophicus] [Methanothermobacter thermautotrophicus str. Delta H] gi|15679383 Replication factor A related protein [Methanothermobacter thermautotrophicus] [Methanothermobacter thermautotrophicus str. Delta H] gi|15669348 Replication factor A related protein [Methanococcus jannaschii] [Methanocaldococcus jannaschii] gi|14520503 Replication factor A related protein [Pyrococcus abyssi] gi|2622495 Replication factor A related protein [Methanothermobacter thermautotrophicus str. Delta H] gi|2622494 Replication factor A related protein [Methanothermobacter thermautotrophicus str. Delta H] gi|18894230 Replication factor A related protein [Pyrococcus furiosus DSM 3638] gi|7521609 Replication factor A related protein PAB2163 - Pyrococcus abyssi (strain Orsay) gi|7482812 Replication factor A related protein - Methanobacterium thermoautotrophicum (strain Delta H) gi|7482811 Replication factor A related protein - Methanobacterium thermoautotrophicum (strain Delta H) gi|5457718 Replication factor A related protein [Pyrococcus abyssi] gi|1500014 Replication factor A related protein [Methanococcus jannaschii] [Methanocaldococcus jannaschii] gi|22299033 Single-stranded DNA-binding protein [Thermosynechococcus elongatus BP-1] gi|17545141 Single-strand Binding Protein (Helix Destabilizing Protein) [Ralstonia solanacearum] gi|15807618 Single-stranded DNA-binding protein [Deinococcus radiodurans] gi|15645859 Single-strand DNA-binding protein (ssb) [Helicobacter pylori 26695] gi|15616611 Single-strand DNA-binding protein (phage-related protein) [Bacillus halodurans] gi|21233884 Single-strand DNA binding protein [Proteus vulgaris] gi|21233779 Single-strand DNA binding protein [Proteus vulgaris] gi|21233694 Single-strand DNA binding protein [Proteus vulgaris] gi|21203068 Single-strand DNA binding protein [Proteus vulgaris] gi|21202963 Single-strand DNA binding protein [Proteus vulgaris] gi|21202878 Single-strand DNA binding protein [Proteus vulgaris] gi|16767506 ssDNA-binding protein controls activity of RecBCD nuclease [Salmonella typhimurium LT2] gi|19746763 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19746681 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19745475 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19745296 Single-strand binding protein [Streptococcus pyogenes MGAS8232] gi|22295215 Single-stranded DNA-binding protein [Thermosynechococcus elongatus BP-1] gi|21325755 Single-stranded DNA-binding protein [Corynebacterium glutamicum ATCC 13032] gi|21324632 Single-stranded DNA-binding protein [Corynebacterium glutamicum ATCC 13032] gi|205544 Single-stranded DNA binding protein precursor [Rattus sp.] gi|22124496 ssDNA-binding protein [Yersinia pestis KIM] gi|586039 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|417811 Single-stranded DNA-binding protein, mitochondrial precursor (Mt-SSB) (MtSSB) (PWP1-interacting protein 17) gi|17137188 Clp-P1; Single stranded-binding protein c6a [Drosophila melanogaster] gi|17137156 Ssb-c31a-P1 [Drosophila melanogaster] gi|16422814 ssDNA-binding protein [Salmonella typhimurium LT2] gi|21957289 ssDNA-binding protein [Yersinia pestis KIM] gi|18249854 Single-stranded DNA binding protein [Aster yellows phytoplasma] gi|17981729 Mitochondrial single stranded DNA-binding protein; low power [Drosophila melanogaster] gi|10955315 Single-strand binding protein [Escherichia coli] gi|9507481 Single-stranded DNA binding protein [Plasmid ColIb-P9] gi|21911117 Single strand DNA binding protein [Streptococcus pyogenes MGAS315] gi|21905327 Single strand DNA binding protein [Streptococcus pyogenes MGAS315] gi|21885285 Single-stranded DNA binding protein [Vibrio cholerae] gi|16751957 Single-stranded DNA binding protein [Plasmid pIPO2T] gi|16610025 Single-stranded DNA binding protein [Plasmid pIPO2T] gi|6968505 Single-strand DNA binding protein [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|18146307 Phage-related single-strand DNA-binding protein [Clostridium perfringens str. 13] gi|18143945 Phage-related single-strand DNA binding protein [Clostridium perfringens str. 13] gi|9626285 Single-stranded DNA binding protein [Bacteriophage lambda] gi|21686516 Single-stranded DNA-binding protein [Arthrobacter aurescens] gi|21672790 Single-strand binding protein [Buchnera aphidicola str. Sg (Schizaphis graminum)] gi|21203507 Single-strand DNA-binding protein of phage phi Sa 2 mw [Staphylococcus aureus subsp. aureus MW2] gi|13700280 Single-strand DNA-binding protein of phage phi PVL [Staphylococcus aureus subsp. aureus N315] gi|21628947 Single-strand DNA binding (helix-destabilizing) protein [Haemophilus influenzae biotype aegyptius] gi|21623439 Single-strand binding protein [Buchnera aphidicola str. Sg (Schizaphis graminum)] gi|21243632 Single-stranded DNA binding protein [Xanthomonas axonopodis pv. citri str. 306] gi|21242946 Single-stranded DNA binding protein [Xanthomonas axonopodis pv. citri str. 306] gi|20809109 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|20808452 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|20807311 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|21591574 Single-strand DNA binding (helix-destabilizing) protein [Haemophilus influenzae biotype aegyptius] gi|17935411 Single-strand DNA binding protein [Agrobacterium tumefaciens str. C58] gi|16272208 Single-stranded DNA binding protein (ssb) [Haemophilus influenzae Rd] gi|16131885 ssDNA-binding protein [Escherichia coli K12] gi|15834295 ssDNA-binding protein [Escherichia coli O157:H7] gi|15804651 ssDNA-binding protein [Escherichia coli O157:H7 EDL933] gi|18311623 Phage-related single-strand DNA-binding protein [Clostridium perfringens] gi|18309269 Phage-related single-strand DNA binding protein [Clostridium perfringens] gi|16802093 Single-strand binding protein (SSB) [Listeria monocytogenes EGD-e] gi|16799117 Single-strand binding protein (SSB) [Listeria innocua] gi|16763010 Single strand binding protein [Salmonella enterica subsp. enterica serovar Typhi] gi|16762936 Single-strand DNA-binding protein [Salmonella enterica subsp. enterica serovar Typhi] gi|16332050 Single-stranded DNA-binding protein [Synechocystis sp. PCC 6803] gi|16120662 Single-strand binding protein [Yersinia pestis] gi|16081142 Single-strand DNA-binding protein [Bacillus subtilis] gi|15965311 Single-strand binding protein [Sinorhizobium meliloti] gi|15926067 Single-strand DNA-binding protein of phage phi PVL [Staphylococcus aureus subsp. aureus N315] gi|15923356 Single-strand DNA-binding protein of phage phi PVL [Staphylococcus aureus subsp. aureus Mu50] gi|15899120 Single-stranded DNA binding protein (SSB) [Sulfolobus solfataricus] gi|15896954 Single strand DNA binding protein, SSB [Clostridium acetobutylicum] gi|15895648 Single-strand DNA-binding protein, ssb [Clostridium acetobutylicum] gi|15895193 Phage related SSB-like protein [Clostridium acetobutylicum] gi|15894232 Single-stranded DNA-binding protein [Clostridium acetobutylicum] gi|15893218 Single-strand binding protein [Rickettsia conorii] gi|15835919 SS DNA binding protein [Chlamydophila pneumoniae J138] gi|15829081 Single-strand DNA binding protein (SSB) (Helix destabilizing protein) [Mycoplasma pulmonis] gi|15828449 Single strand binding protein [Mycobacterium leprae] gi|15794566 Single-strand binding protein [Neisseria meningitidis Z2491] gi|15792396 Single-strand binding protein [Campylobacter jejuni] gi|15618301 SS DNA Binding Protein [Chlamydophila pneumoniae CWL029] gi|15617138 Single-strand binding protein [Buchnera sp. APS] gi|15612231 Single-strand binding protein [Helicobacter pylori J99] gi|15607196 ssb [Mycobacterium tuberculosis H37Rv] gi|15605660 Single stranded DNA-binding protein [Aquifex aeolicus] gi|15604763 SS DNA Binding Protein [Chlamydia trachomatis] gi|15604667 Single-strand binding protein (ssb) [Rickettsia prowazekii] gi|15603815 Ssb [Pasteurella multocida] gi|15599428 Single-stranded DNA-binding protein [Pseudomonas aeruginosa] gi|13507968 Single-stranded DNA binding protein [Mycoplasma pneumoniae] gi|13358117 Single-strand binding protein [Ureaplasma urealyticum] gi|12044943 Single-stranded DNA-binding protein (ssb) [Mycoplasma genitalium] gi|21539818 Ssb [Lactococcus lactis subsp. cremoris] gi|15639056 Single-strand DNA binding protein (ssb) [Treponema pallidum] gi|15594460 Single-stranded DNA-binding protein (ssb) [Borrelia burgdorferi] gi|17865707 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|8478517 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|1174443 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|417647 Single-stranded DNA-binding protein RIM1, mitochondrial precursor (ssDNA-binding protein, mitochondrial) gi|138390 Single-stranded DNA binding protein (Helix-destabilizing protein) (Gp32) gi|134913 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|21400036 SSB, Single-strand binding protein family [Bacillus anthracis A2012] [Bacillus anthracis str. A2012] gi|21397955 SSB, Single-strand binding protein family [Bacillus anthracis A2012] [Bacillus anthracis str. A2012] gi|18920500 ssb [Staphylococcus aureus phage phi 11] gi|16505317 Single strand binding protein [Salmonella enterica subsp. enterica serovar Typhi] gi|16505243 Single-strand DNA-binding protein [Salmonella enterica subsp. enterica serovar Typhi] gi|16412459 Single-strand binding protein (SSB) [Listeria innocua] gi|16409404 Single-strand binding protein (SSB) [Listeria monocytogenes] gi|15978425 Single-strand binding protein [Yersinia pestis] gi|21232166 Single-stranded DNA binding protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|21282071 Single-strand DNA-binding protein of phage phi Sa 2mw [Staphylococcus aureus subsp. aureus MW2] gi|21222314 Single-strand DNA-binding protein [Streptomyces coelicolor A3(2)] gi|21221138 Single-strand DNA-binding protein [Streptomyces coelicolor A3(2)] gi|21109208 Single-stranded DNA binding protein [Xanthomonas axonopodis pv. citri str. 306] gi|21108448 Single-stranded DNA binding protein [Xanthomonas axonopodis pv. citri str. 306] gi|8978758 SS DNA binding protein [Chlamydophila pneumoniae J138] gi|21113919 Single-stranded DNA binding protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|20910891 Single-stranded DNA binding protein, mitochondrial precursor (MT-SSB) (MTSSB) (P16) [Mus musculus] gi|8052392 Single-strand DNA-binding protein [Streptomyces coelicolor A3(2)] gi|4808403 Single-strand DNA-binding protein [Streptomyces coelicolor A3(2)] gi|20517787 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|20517069 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|20515823 Single-stranded DNA-binding protein [Thermoanaerobacter tengcongensis] gi|15074901 SSB protein [Streptococcus pneumoniae bacteriophage MM1] gi|19748994 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19748904 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19747591 Single strand binding protein [Streptococcus pyogenes MGAS8232] gi|19747395 Single-strand binding protein [Streptococcus pyogenes MGAS8232] gi|6647829 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|13432209 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|1711533 Single-stranded DNA-binding protein, mitochondrial precursor (Mt-SSB) (MtSSB) gi|10956609 Single-strand binding protein homolog Ssb [Corynebacterium glutamicum] gi|19352383 Ssb protein [uncultured bacterium] gi|15088755 SSB protein [Streptococcus pneumoniae bacteriophage MM1] gi|19070050 Ssb protein [uncultured bacterium] gi|19032310 Single-stranded DNA-binding protein [Anabaena variabilis] gi|18920719 Single-strand binding protein Ssb [Bartonella bacilliformis] gi|11875133 Single-stranded DNA binding protein [Escherichia coli O157:H7] gi|8918883 Single-strand DNA binding protein [Plasmid F] gi|7649839 Ea10 protein; Ssb [Escherichia coli O157:H7] gi|5103190 Single strand DNA binding protein [Plasmid R100] gi|15919964 Ssb protein [Plasmid pSB102] gi|15722263 Ssb protein [Plasmid pSB102] gi|18654211 Single strand binding protein [Bacteriophage LL-H] gi|14246134 Single-strand DNA-binding protein of phage phi PVL [Staphylococcus aureus subsp. aureus Mu50] gi|14195223 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|11387162 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647831 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|11387134 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647828 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647827 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647825 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647824 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647823 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647820 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|6647819 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|2500889 Single-stranded DNA binding protein gi|1351118 Single-stranded DNA binding protein gi|730833 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|134905 Single-strand binding protein (SSB) (Helix-destabilizing protein) gi|4507231 Single-stranded DNA-binding protein 1 [Homo sapiens] gi|14794570 Ssb [Cloning vector pRK310] gi|18150888 SSB protein [Pseudomonas putida] gi|18143627 Single-stranded DNA binding protein [Aster yellows phytoplasma] gi|18104278 ssb protein [Enterococcus faecalis] gi|18104262 ssb protein [Enterococcus faecalis] gi|18077129 SSB protein [Pseudomonas putida] gi|17977995 Single stranded DNA-binding protein SSB [Escherichia coli] gi|17864928 SSB-like protein [Haemophilus influenzae biotype aegyptius] gi|17739937 Single-strand DNA binding protein [Agrobacterium tumefaciens str. C58] gi|9507773 Single-strand DNA binding protein [Plasmid F] gi|9507591 Single strand DNA binding protein [Plasmid R100] gi|17427432 Single-strand binding protein (helix destabilizing protein) [Ralstonia solanacearum] gi|17381298 SSB protein [uncultured bacterium] gi|13561952 Single-stranded DNA-binding protein [Mycobacterium smegmatis] gi|12830947 SSB [bacteriophage bIL286] gi|12830884 SSB protein [bacteriophage bIL285] gi|5001700 Single-strand binding protein; SSB [Bacteriophage Tuc2009] gi|82212 ssb protein homolog - common tobacco chloroplast gi|13786543 SSB [Lactococcus lactis bacteriophage TP901-1] gi|13661686 SSB [Lactococcus lactis bacteriophage TP901-1] gi|13095695 SSB protein [bacteriophage bIL285] gi|12829834 Single stranded binding protein [Lactococcus lactis bacteriophage TP901-1] gi|12248112 SSB [Bacillus phage GA-1] gi|9632484 Single-stranded DNA binding protein [Bacteriophage 933W] gi|16973267 ssb protein [uncultured bacterium] gi|16798847 SSB protein [Bacteriophage A118] gi|13487814 Single-strand binding protein; SSB [Bacteriophage Tuc2009] gi|13095758 SSB [bacteriophage bIL286] gi|12141282 SSB [Bacillus phage GA-1] gi|7960759 Single-stranded DNA binding protein [Bacillus phage Nf] gi|6094357 Single-strand binding protein (SSB) (EARLY PROTEIN GP5) gi|6094356 Single-strand binding protein (SSB) (EARLY PROTEIN GP5) gi|5823662 SSB protein [Bacteriophage A118] gi|5354247 ssb; helix-destabilizing [Enterobacteria phage T4] gi|4426959 Single-stranded DNA-binding protein SSB-P1 [Enterobacteria phage P1] gi|4262664 SSB [Bacteriophage TuIb] gi|4262663 SSB [Bacteriophage Mi] gi|3915274 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915273 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915272 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915271 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915270 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915269 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915268 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915267 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915266 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915265 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915264 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915263 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915262 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915261 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915248 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|3915242 Single-stranded DNA binding protein (helix destabilizing protein) (GP32) gi|2645797 SSB [Bacteriophage SV76] gi|2645795 SSB [Bacteriophage RB69] gi|2645793 SSB [Bacteriophage RB32] gi|2645791 SSB [Bacteriophage RB27] gi|2645789 SSB [Bacteriophage RB18] gi|2645787 SSB [Bacteriophage RB15] gi|2645785 SSB [Bacteriophage RB10] gi|2645783 SSB [Bacteriophage RB9] gi|2645781 SSB [Bacteriophage RB8] gi|2645779 SSB [Bacteriophage RB6] gi|2645777 SSB [Bacteriophage RB3] gi|2645775 SSB [Bacteriophage PST] gi|2645773 SSB [Bacteriophage M1] gi|2645770 SSB [bacteriophage FS-alpha] gi|2645768 SSB [Enterobacteria phage SV14] gi|2645766 SSB [Bacteriophage RB70] gi|1429233 SSB [Bacteriophage B103] gi|138392 Helix-destabilizing protein (Single-stranded DNA-binding protein) (SSB protein) gi|138391 Single-stranded DNA binding protein (Helix-destabilizing protein) (GP32) gi|138389 Helix-destabilizing protein (Single-stranded DNA-binding protein) (SSB protein) gi|138388 Single-stranded DNA binding protein (Helix-destabilizing protein) (GP32) gi|138072 Single-strand binding protein (SSB) (Early protein GP5) gi|13937510 SSB protein [Pseudomonas sp. ADP] gi|15620434 Single-strand binding protein [Rickettsia conorii] gi|1568593 ssb [Mycobacterium tuberculosis H37Rv] gi|10955209 SSB [Enterobacter aerogenes] gi|1572546 SSB [Enterobacter aerogenes] gi|15026829 Single-strand DNA-binding protein, SSB [Clostridium acetobutylicum] gi|15025394 Single-strand DNA-binding protein, ssb [Clostridium acetobutylicum] gi|15024899 Phage related SSB-like protein [Clostridium acetobutylicum] gi|13815667 Single-stranded DNA binding protein (SSB) [Sulfolobus solfataricus] gi|9837391 Ssb [Flavobacterium johnsoniae] gi|14090025 Single-strand binding protein (SSB) (Helix-destabilizing protein) [Mycoplasma pulmonis] gi|13992542 Single-stranded DNA binding [Oryctolagus cuniculus] gi|13774090 Single-stranded DNA binding protein [Aster yellows phytoplasma] gi|13661656 Single strand binding protein Ssb [Comamonas testosteroni] gi|12519013 ssDNA-binding protein [Escherichia coli O157:H7 EDL933] gi|13364518 ssDNA-binding protein [Escherichia coli O157:H7] gi|4115492 Single strand binding protein [Phytoplasma sp.] gi|12722386 Ssb [Pasteurella multocida] gi|10954410 Single strand binding protein [Actinobacillus actinomycetemcomitans] gi|10880887 Single strand binding protein [Actinobacillus actinomycetemcomitans] gi|13093879 Single strand binding protein [Mycobacterium leprae] gi|4583407 Single-strand binding protein homolog Ssb [Corynebacterium glutamicum] gi|10176674 Single-strand DNA-binding protein (phage-related protein) [Bacillus halodurans] gi|7380314 Single-stranded binding protein [Neisseria meningitidis Z2491] gi|4376665 SS DNA Binding Protein [Chlamydophila pneumoniae CWL029] gi|1790494 ssDNA-binding protein [Escherichia coli K12] gi|7428645 Single-stranded DNA-binding protein 1 precursor, mitochondrial - African clawed frog gi|1674304 Single-stranded DNA binding protein [Mycoplasma pneumoniae] gi|7439948 Single-strand binding protein (ssb) RP836 - Rickettsia prowazekii gi|7439930 ssb protein - Mycobacterium tuberculosis (strain H37RV) gi|7439921 Single-stranded DNA-binding protein 2 precursor, mitochondrial - African clawed frog gi|2146650 Single-stranded DNA-binding protein ssb - Mycoplasma pneumoniae (strain ATCC 29342) gi|2127217 Single-stranded DNA-binding protein ssb - Bacillus subtilis gi|2120579 Single-stranded DNA-binding protein - Brucella abortus gi|2119790 Excinuclease ABC chain A - Brucella abortus (fragment) gi|423723 Single-stranded mitochondrial DNA-binding protein precursor - rat gi|423082 Single-stranded mitochondrial DNA-binding protein precursor - human gi|96089 Helix-destabilizing protein - plasmid RK2 gi|3328436 SS DNA Binding Protein [Chlamydia trachomatis] gi|6899559 Single-strand binding protein [Ureaplasma urealyticum] gi|7297359 Ssb-c31a gene product [Drosophila melanogaster] gi|9954966 Chain D, Crystal Structure Of Chymotryptic Fragment Of E. Coli Ssb Bound To Two 35-Mer Single Strand Dnas gi|9954965 Chain C, Crystal Structure Of Chymotryptic Fragment Of E. Coli Ssb Bound To Two 35-Mer Single Strand Dnas gi|9954964 Chain B, Crystal Structure Of Chymotryptic Fragment Of E. Coli Ssb Bound To Two 35-Mer Single Strand Dnas gi|9954963 Chain A, Crystal Structure Of Chymotryptic Fragment Of E. Coli Ssb Bound To Two 35-Mer Single Strand Dnas gi|10039203 Single-strand binding protein [Buchnera sp. APS] gi|9950448 Single-stranded DNA-binding protein [Pseudomonas aeruginosa] gi|9230773 Single stranded DNA-binding protein [Thermus aquaticus] gi|6841054 Single-stranded DNA-binding protein [Borrelia hermsii] gi|8569292 Chain D, Crystal Structure Analysis Of Single Stranded Dna Binding Protein (Ssb) From E. Coli gi|8569291 Chain C, Crystal Structure Analysis Of Single Stranded Dna Binding Protein (Ssb) From E. Coli gi|8569290 Chain B, Crystal Structure Analysis Of Single Stranded Dna Binding Protein (Ssb) From E. Coli gi|8569289 Chain A, Crystal Structure Analysis Of Single Stranded Dna Binding Protein (Ssb) From E. Coli gi|8548923 Single stranded binding protein [Thermus thermophilus] gi|7388261 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|2815500 Single-strand DNA-binding protein R, mitochondrial precursor (MT-SSB-R) (MT-SSB 2) gi|586040 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|417812 Single-strand DNA-binding protein, mitochondrial precursor (MT-SSB) (MTSSB) (P16) gi|134916 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|134914 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|134912 Single-strand DNA-binding protein S, mitochondrial precursor (MT-SSB-S) (MT-SSB 1) gi|134910 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|134906 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|134904 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|134903 Single-strand binding protein (SSB) (Helix destabilizing protein) gi|6513859 Single strand binding protein [Salmonella typhi] gi|7439942 Single-stranded DNA-binding protein (ssb) homolog - Lyme disease spirochete gi|7439928 Single-strand DNA binding protein (ssb) - syphilis spirochete gi|7428646 Single-stranded DNA-binding protein - Escherichia coli gi|1361850 Single-stranded DNA binding protein ssb homolog - Mycoplasma genitalium gi|484396 Single-stranded DNA-binding protein - Serratia marcescens gi|70818 Single-stranded DNA-binding protein - Escherichia coli plasmid F gi|70817 Single-stranded DNA-binding protein - Escherichia coli plasmid ColIb-P9 gi|7264824 Single-stranded DNA-binding protein [Escherichia coli] gi|3114758 Single strand DNA binding protein [Campylobacter jejuni] gi|6739548 SSB protein [Thermus thermophilus] gi|466378 SSB [Plasmid R751] gi|2735512 SSB [Staphylococcus carnosus] gi|4688844 SSB protein [Escherichia coli] gi|6066193 Single strand binding protein [Sinorhizobium meliloti] gi|6015512 SSB-like protein [unidentified] gi|2959411 Single-stranded binding protein [Mycobacterium leprae] gi|5702178 Single stranded DNA binding protein [Escherichia coli] gi|3337047 Single-strand binding protein [Escherichia coli] gi|4585395 Single-stranded DNA binding protein [Bacteriophage 933W] gi|2314411 Single-strand DNA-binding protein (ssb) [Helicobacter pylori 26695] gi|4512478 Single-stranded DNA binding protein [Plasmid ColIb-P9] gi|4377534 ssb protein [Escherichia coli] gi|3851548 Single strand DNA-binding protein; SSB [Vibrio cholerae] gi|4261534 Single-stranded DNA binding protein [Saccharomyces cerevisiae] gi|4155774 Single-strand binding protein [Helicobacter pylori J99] gi|4099056 Single-stranded DNA binding protein [Rhodobacter sphaeroides] gi|3861362 Single-strand binding protein (ssb) [Rickettsia prowazekii] gi|3844678 Single-stranded DNA-binding protein (ssb) [Mycoplasma genitalium] gi|3822198 Single strand binding protein [Escherichia coli O157:H7] gi|2780888 Chain D, Structure Of Single Stranded Dna Binding Protein (Ssb) gi|2780887 Chain C, Structure Of Single Stranded Dna Binding Protein (Ssb) gi|2780886 Chain B, Structure Of Single Stranded Dna Binding Protein (Ssb) gi|2780885 Chain A, Structure Of Single Stranded Dna Binding Protein (Ssb) gi|2687989 Single-stranded DNA-binding protein (ssb) [Borrelia burgdorferi] gi|3322320 Single-strand DNA binding protein (ssb) [Treponema pallidum] gi|1502417 Single-stranded DNA binding protein p12 subunit [Schizosaccharomyces pombe] gi|1502415 Single-stranded DNA binding protein p30 subunit [Schizosaccharomyces pombe] gi|1502413 Single-stranded DNA binding protein p68 subunit [Schizosaccharomyces pombe] gi|396394 Single-strand DNA-binding protein [Escherichia coli] gi|3600051 Similar to the single-strand binding proteins family (Pfam: SSB.hmm, score: 24.02) [Arabidopsis thaliana] gi|3323586 Single-strand binding protein [Salmonella typhimurium] gi|1573216 Single-stranded DNA binding protein (ssb) [Haemophilus influenzae Rd] gi|2982816 Single stranded DNA-binding protein [Aquifex aeolicus] gi|2636637 Single-strand DNA-binding protein [Bacillus subtilis] gi|467374 Single strand DNA binding protein [Bacillus subtilis] gi|104268 Single-stranded DNA-binding protein r - African clawed frog mitochondrion (SGC1) gi|104182 Single-stranded DNA-binding protein 1 precursor, mitochondrial - African clawed frog gi|1490785 Single stranded DNA-binding protein [Shewanella sp. SC2A] gi|1490783 Single stranded DNA-binding protein [Shewanella sp. F1A] gi|1490781 Single stranded DNA-binding protein [Shewanella sp. PT99] gi|1490779 Single stranded DNA-binding protein [Shewanella hanedai] gi|483597 Single-stranded DNA binding protein [Pseudomonas aeruginosa] gi|264475 SSb = 12 kda basic functional DNA binding region of 30 kda single-stranded nucleic-acid-specific acidic protein {N-terminal} [Pisum sativum = peas, cv. Arkel, Peptide Chloroplast Partial, 20 aa] gi|264474 SSB = 28 kda single-stranded nucleic-acid-specific acidic protein {N-terminal} [Pisum sativum = peas, cv. Arkel, Peptide Chloroplast Partial, 17 aa] gi|264473 SSB = 30 kda single-stranded nucleic-acid-specific acidic protein {N-terminal} [Pisum sativum = peas, cv. Arkel, Peptide Chloroplast Partial, 25 aa] gi|264472 SSB = 33 kda single-stranded nucleic-acid-specific acidic protein {N-terminal} [Pisum sativum = peas, cv. Arkel, Peptide Chloroplast Partial, 25 aa] gi|254074 Single-stranded DNA binding protein; SSB [Saccharomyces cerevisiae] gi|1097885 ssDNA-binding protein gi|225266 ssb-like ORF 273 gi|64899 mitochondrial DNA specific single-stranded DNA binding protein (mt-SSB) [Xenopus laevis] gi|47270 Single-stranded DNA-binding protein [Serratia marcescens] gi|45638 Single-stranded DNA-binding protein [Proteus mirabilis] gi|144656 Single-stranded DNA-binding protein [Plasmid ColIb-P9] gi|1107472 Single stranded DNA binding protein [Plasmid F] gi|662792 Single-stranded DNA binding protein [uncultured eubacterium] gi|507347 SSB [Haemophilus influenzae] gi|188856 Single stranded DNA binding protein [Homo sapiens] gi|552025 Single stranded DNA binding protein [Salmonella typhimurium] gi|147870 Single-strand DNA-binding protein (ssb) [Escherichia coli] gi|409951 Mitochondrial single-stranded DNA-binding protein [Drosophila melanogaster] gi|144126 Single stranded DNA binding protein [Brucella melitensis biovar Abortus]

Isolated. With respect to polypeptides, “isolated” refers to a polypeptide that constitutes a major component in a mixture of components, e.g., 30% or more, 40% or more, 50% or more, 60% or more, 70% or more, 80% or more, 90% or more, or 95% or more by weight. Isolated polypeptides typically are obtained by purification from an organism that contains the polypeptide (e.g., a transgenic organism that expresses the polypeptide), although chemical synthesis is also feasible. Methods of polypeptide purification include, for example, ammonium sulfate precipitation, chromatography and immunoaffinity techniques.

A polypeptide of the invention can be detected by any means known in the art, including sodium dodecyl sulfate (SDS)-polyacrylamide gel electrophoresis followed by Coomassie Blue-staining or Western blot analysis using monoclonal or polyclonal antibodies that have binding affinity for the polypeptide to be detected.

Thermostable. “Thermostable” refers to an enzyme or protein (e.g., DNAP, RT and SSB) that is resistant to inactivation by heat. In general, a thermostable enzyme is more resistant to heat inactivation than a mesophilic enzyme. Thus, the nucleic acid synthesis activity or single stranded binding activity of thermostable enzyme or protein may be reduced by heat treatment to some extent, but not as much as mesophilic enzyme or protein.

A thermostable DNAP retains at least 50% (e.g., at least 60%, at least 70%, at least 80%, at least 90%, and at least 95%) of its nucleic acid synthetic activity after being heated in a nucleic acid synthesis mixture at 90° C. for 30 seconds. In contrast, mesophilic DNAPs lose most of their nucleic acid synthetic activity after such heat treatment. Thermostable DNAPs typically also have a higher optimum nucleic acid synthesis temperature than the mesophilic T5 DNAP.

Thermostable SSBs bind ssDNA at 70° C. at least 70% (e.g., at least 80%, at least 85%, at least 90%, and at least 95%) as well as they do at 37° C. The degree to which an SSB binds ssDNA at such temperatures can be determined by measuring intrinsic SSB fluorescence. Intrinsic SSB fluorescence is related to conserved OB fold amino acids, and is quenched upon binding to ssDNA (see e.g., Alani, E. et al. (1992) J. Mol. Biol. 227:54-71). A routine protocol for determining SSB-ssDNA binding is described in Kelly, T. et al. (1998) Proc. Natl. Acad. Sci. USA 95:14634-14639. Briefly, SSB-ssDNA binding reactions are performed in 2 ml buffer containing 30 mM HEPES (pH 7.8), 100 mM NaCl, 5 mM MgCl2, 0.5% inositol and 1 mM DTT. A fixed amount of SSB is incubated with varying quantities of poly(dT), and fluorescence is measured using an excitation wavelength of about 295 nm and an emission wavelength of about 348 nm.

Fidelity. Fidelity refers to the accuracy of nucleic acid polymerization; the ability of DNAP to discriminate correct from incorrect substrates (e.g., nucleotides) when synthesizing nucleic acid molecules which are complementary to a template. The higher the fidelity, the less the enzyme misincorporates nucleotides in the growing strand during nucleic acid synthesis. Thus, an increase or enhancement in fidelity results in more faithful nucleic acid synthesis by DNAP or RT, with decreased misincorporation.

Increased/enhanced/higher fidelity means having an increase in fidelity, preferably about 1.2 to about 10,000 fold, about 1.5 to about 10,000 fold, about 2 to about 5,000 fold, or about 2 to about 2000 fold (preferably greater than about 5 fold, more preferably greater than about 10 fold, still more preferably greater than about 50 fold, still more preferably greater than about 100 fold, still more preferably greater than about 500 fold and most preferably greater than about 100 fold) reduction in the number of misincorporated nucleotides during synthesis of a nucleic acid of given length compared to the fidelity of a control DNAP (e.g., in the absence of SSBs) during nucleic acid synthesis.

Reduced misincorporation means less than 90%, less than 85%, less than 75%, less than 70%, less than 60%, or preferably less than 50%, preferably less than 25%, more preferably less than 10%, and most preferably less than 1% of relative misincorporation compared to a control DNAP (e.g., in the absence of SSBs) during nucleic acid polymerization.

Homologs and Variants. Homologs and variants suitable for the compositions and methods of the invention can be identified by homologous nucleotide and polypeptide sequence analyses. Known polypeptides in one organism can be used to identify homologous polypeptides in another organism. For example, performing a query on a database of nucleotide or polypeptide sequences can identify homologs of a known polypeptide. Homologous sequence analysis can involve BLAST or PSI-BLAST analysis of databases using known polypeptide amino acid sequences. Those proteins in the database that have greater than 35% sequence identity are candidates for further evaluation for suitability in the compositions and methods of the invention. If desired, manual inspection of such candidates can be carried out in order to narrow the number of candidates that can be further evaluated. Manual inspection is performed by selecting those candidates that appear to have domains conserved among known polypeptides

Vector. A vector is a nucleic acid such as a plasmid, cosmid, phage, or phagemid that can replicate autonomously in a host cell. A vector has one or a small number of sites that can be cut by a restriction endonuclease in a determinable fashion, and into which DNA can be inserted. A vector also can include a marker suitable for use in identifying hosts that contain the vector. Markers confer a recognizable phenotype on host cells in which such markers are expressed. Commonly used markers include antibiotic resistance genes such as those that confer tetracycline resistance or ampicillin resistance. Vectors also can contain sequences encoding polypeptides that facilitate the introduction of the vector into a host. Such polypeptides also can facilitate the maintenance of the vector in a host. “Expression vectors” include nucleic acid sequences that can enhance and/or regulate the expression of inserted DNA, after introduction into a host. Expression vectors contain one or more regulatory elements operably linked to a DNA insert. Such regulatory elements include promoter sequences, enhancer sequences, response elements, protein recognition sites, or inducible elements that modulate expression of a nucleic acid. As used herein, “operably linked” refers to positioning of a regulatory element in a vector relative to a DNA insert in such a way as to permit or facilitate transcription of the insert and/or translation of resultant RNA transcripts. The choice of element(s) included in an expression vector depends upon several factors, including, replication efficiency, selectability, inducibility, desired expression level, and cell or tissue specificity.

Host. The term “host” includes prokaryotes, such as E. coli, and eukaryotes, such as fungal, insect, plant and animal cells. Animal cells include, for example, COS cells and HeLa cells. Fungal cells include yeast cells, such as Saccharomyces cereviseae cells. A host cell can be transformed or transfected with a vector using techniques known to those of ordinary skill in the art, such as calcium phosphate or lithium acetate precipitation, electroporation, lipofection and particle bombardment. Host cells that contain a vector or portion thereof (a.k.a. “recombinant hosts”) can be used for such purposes as propagating the vector, producing a nucleic acid (e.g., DNA, RNA, antisense RNA) or expressing a polypeptide. In some cases, a recombinant host contains all or part of a vector (e.g., a DNA insert) on the host genome.

SSB-DNAP Fusion Proteins

An SSB-DNAP fusion protein may be constructed with the SSB portion at the N-terminus and the polymerase portion at the C-terminus or vice-versa. Thus, the DNA construct encoding the fusion protein may comprise the SSB portion upstream (5′) of the polymerase portion or vice versa. SSB genes are cloned upstream (or downstream) and in frame with a DNAP gene using methods well known in the art of molecular biology. In one embodiment, the DNAP is a DNA-directed DNA polymerase. In another embodiment, the DNAP is an RNA-directed DNA polymerase. The two portions may be immediately adjacent to each other, or may be separated by an amino acid linker. The amino acid linker may be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, 60, 70, 80, 90 or 100 or more amino acids in length. In one embodiment, the SSB(s) are thermostable. In some embodiments, nucleic acid synthesis compositions of the invention include two or more fusion proteins. In one embodiment, the SSBs are thermostable SSBs.

SSB-DNAP fusion proteins include, but are not limited to, polypeptides comprising, or consisting of, the amino acid sequence shown in SEQ ID NO: 2, and/or mutants, fragments, and variants thereof. Such fragments include those that retain substantial polymerase, 3′-5′ exonuclease activity and/or 5′-3′ exonuclease activity (e.g., at least 50%, 60%, 70%, 80%, 85%, 90%, 95% or 99% of the activity of the fusion protein shown in SEQ ID NO: 2). Although the Sso SSB and polymerase portions of the fusion protein shown in SEQ ID NO: 2 contain an amino acid linker separating the two portions, the two portions may also be joined in the absence of such an amino acid linker. The present invention also includes polynucleotides encoding such polypeptides (e.g., SEQ ID NO: 1), and mutants thereof including insertions, deletions and point mutations.

Expression and Purification of Fusion Proteins

To optimize expression of the fusion proteins described herein, inducible or constitutive promoters well known in the art may be used to control expression of a recombinant fusion protein gene in a recombinant host. Similarly, high or low copy number vectors, well known in the art, may be used to achieve appropriate levels of expression. Vectors having an inducible high copy number may also be useful to enhance expression of the fusion proteins in a recombinant host.

Prokaryotic vectors for constructing the plasmid library include plasmids such as those capable of replication in E. coli, including, but not limited to, pBR322, pET-26b(+), ColE1, pSC101, pUC vectors (pUC18, pUC19, etc., in Molecular Cloning, a Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). Bacillus plasmids include pC194, pC221, pC217, etc. (Glyczan, in Molecular Biology Bacilli, Academic Press, New York, pp 307-329. 1982). Suitable Streptomyces plasmids include pIJ101 (Kendall et al., J. Bacteriol. 169:4177-4183, 1987). Pseudomonas plasmids are reviewed by John et al. (Rad. Insec. Dis. 8:693-704, 1986) and Igaki (Jpn. J. Bacteriol. 33:729-742, 1978). Broad-host range plasmids or cosmids, such as pCP13 (Darzins et al., J. Bacteriol. 159:9-18, 1984) can also be used.

Fusion protein may be cloned in a prokaryotic host such as E. coli or other bacterial species including, but not limited to, Escherichia, Pseudomonas, Salmonella, Serratia, and Proteus. Eukaryotic hosts also can be used for cloning and expression of wild type or mutant polymerases. Such hosts include yeast, fungi, insect and mammalian cells. Expression of the desired DNA polymerase in such eukaryotic cells may involve the use of eukaryotic regulatory regions which include eukaryotic promoters. Cloning and expressing the fusion proteins in eukaryotic cells may be accomplished by well known techniques using well known eukaryotic vector systems.

Hosts can be transformed by routine, well-known techniques. In one embodiment, transformed colonies are plated and screened for the expression of a fusion protein by transferring transformed E. coli colonies to nitrocellulose membranes. After the transformed cells are grown on nitrocellulose, the cells are lysed by standard techniques, and the membranes are then treated at 95° C. for 5 minutes to inactivate the endogenous E. coli enzyme. Other temperatures may be used to inactivate the host polymerases depending on the host used and the temperature stability of the fusion protein to be cloned. Fusion protein activity is then detected by assaying for the presence of DNA polymerase activity using well known techniques (i.e. Sanger et al., Gene 97:119-123, 1991).

Also described herein are host cells that contain or comprise nucleic acid molecules, and vectors that contain or comprise these nucleic acid molecules. Also included are methods for making the polypeptides (e.g., methods for producing polypeptides using these nucleic acid molecules and host cells). In particular embodiments, polypeptides are provided in (1) isolated, (2) substantially pure, and/or (3) essentially pure forms. Other aspects include compositions and mixtures (e.g., reaction mixtures) that contain or comprise one or more polypeptides and/or more polynucleotides described herein.

To optimize expression of the fusion proteins, inducible or constitutive promoters are well known and may be used to express high levels of a fusion protein in a recombinant host. Similarly, high copy number vectors, well known in the art, may be used to achieve or enhance expression of the fusion protein in a recombinant host.

To express the desired fusion protein in a prokaryotic cell (such as, E. coli, B. subtilis, Pseudomonas, etc.), the gene encoding the fusion protein may be operably linked to a functional prokaryotic promoter. However, the natural promoter may function in prokaryotic hosts allowing expression of the fusion protein. Thus, the natural promoter or other promoters may be used to express the fusion protein. Such other promoters may be used to enhance expression and may either be constitutive or regulatable (i.e., inducible or derepressible) promoters. Examples of constitutive promoters include the int promoter of bacteriophage λ, and the bla promoter of the β-lactamase gene of pBR322. Examples of inducible prokaryotic promoters include the major right and left promoters of bacteriophage λ, (PR and PL), trp, recA, lacZ, lacI, tet, gal, trc, and tac promoters of E. coli. The B. subtilis promoters include α-amylase (Ulmanen et al., J. Bacteriol 162:176-182 (1985)) and Bacillus bacteriophage promoters (Gryczan, T., supra.). Streptomyces promoters are described by Ward et al., Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic promoters are also reviewed by Glick, J. Ind. Microbiol. 1:277-282, 1987; Cenatiempto, Y., Biochimie 68:505-516, 1986; and Gottesman, Ann. Rev. Genet. 18:415-442 (1984). Expression in a prokaryotic cell also requires the presence of a ribosomal binding site upstream of the gene-encoding sequence. Such ribosomal binding sites are disclosed, for example, by Gold et al., Ann. Rev. Microbiol. 35:365-404 (1981).

In one embodiment, the fusion proteins described herein are produced by fermentation of the recombinant host containing and expressing the cloned fusion protein gene. Any nutrient that can be assimilated by the thermophile of interest, or a host containing the cloned fusion protein gene, may be added to the culture medium. Optimal culture conditions should be selected case by case according to the strain used and the composition of the culture medium. Antibiotics may also be added to the growth media to insure maintenance of vector DNA containing the desired gene to be expressed.

Recombinant host cells producing the fusion proteins of the invention can be separated from liquid culture, for example, by centrifugation. In general, the collected microbial cells are dispersed in a suitable buffer, and then broken down by ultrasonic treatment or by other well known procedures to allow extraction of the enzymes by the buffer solution. After removal of cell debris by ultracentrifugation or centrifugation, the fusion protein can be purified by standard protein purification techniques such as extraction, precipitation, chromatography, affinity chromatography, electrophoresis or the like. Assays to detect the presence of the fusion proteins during purification are well known in the art and can be used during conventional biochemical purification methods to determine the presence of these enzymes.

Use of Fusion Proteins

The fusion proteins described herein may be used in well known DNA sequencing, DNA labeling, DNA amplification or cDNA synthesis reactions. The fusion proteins may also be used to analyze and/or type polymorphic DNA Fragments

Nucleic Acid Synthesis

SSB-DNAP fusion proteins may be used in nucleic acid synthesis reactions which comprise: (a) mixing one or more templates with one or more fusion proteins to form a mixture; and (b) incubating the mixture under conditions sufficient to make a nucleic acid complementary to all or a portion of the templates (i.e., a primer extension product). Reaction conditions sufficient to allow nucleic acid synthesis (e.g., pH, temperature, ionic strength, and incubation time) can be optimized according to routine methods known to those skilled in the art and may involve the use of one or more primers, one or more nucleotides, and/or one or more buffers or buffering salts, or any combination thereof.

SSB-DNAP fusion proteins may be used in amplification methods comprising: (a) mixing one or more templates with one or more fusion proteins to form a mixture; and (b) incubating the mixture under conditions sufficient to amplify a nucleic acid complementary to all or a portion of the templates. Such conditions may involve the use of one or more primers, one or more nucleotides, one or more buffers and/or one or more buffering salts, or any combination thereof. Conditions to facilitate nucleic acid synthesis such as pH, ionic strength, temperature and incubation time can be determined as a matter of routine by those skilled in the art.

Following nucleic acid synthesis, nucleic acids can be isolated for further use or characterization. Synthesized nucleic acids can be separated from other nucleic acids and other constituents present in a nucleic acid synthesis reaction by any means known in the art, including gel electrophoresis, capillary electrophoresis, chromatography (e.g., size, affinity and immunochromatography), density gradient centrifugation, and immunoadsorption. Separating nucleic acids by gel electrophoresis provides a rapid and reproducible means of separating nucleic acids, and permits direct, simultaneous comparison of nucleic acids present in the same or different samples. Nucleic acids made by the provided methods can be isolated using routine methods. For example, nucleic acids can be removed from an electrophoresis gel by electroelution or physical excision. Isolated nucleic acids can be inserted into vectors, including expression vectors, suitable for transfecting or transforming prokaryotic or eukaryotic cells.

DNA Sequencing

SSB-DNAP fusion proteins can be used in sequencing reactions (isothermal DNA sequencing and cycle sequencing of DNA). For example, SSB-DNAP fusion proteins can be used for dideoxy-mediated sequencing involves the use of a chain-termination technique which uses a specific polymer for extension by DNA polymerase, a base-specific chain terminator and the use of polyacrylamide gels to separate the newly synthesized chain-terminated DNA molecules by size so that at least a part of the nucleotide sequence of the original DNA molecule can be determined Specifically, a DNA molecule is sequenced by using four separate DNA sequence reactions, each of which contains different base-specific terminators. For example, the first reaction will contain a G-specific terminator, the second reaction will contain a T-specific terminator, the third reaction will contain an A-specific terminator, and a fourth reaction may contain a C-specific terminator. Preferred terminator nucleotides include dideoxyribonucleoside triphosphates (ddNTPs) such as ddATP, ddTTP, ddGTP, ddITP and ddCTP. Analogs of dideoxyribonucleoside triphosphates may also be used and are well known in the art. Detectably labeled nucleotides are typically included in sequencing reactions. Any number of labeled nucleotides can be used in sequencing (or labeling) reactions, including, but not limited to, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels, and enzyme labels.

The fusion proteins may also be used in cycle sequencing reactions. Cycle sequencing often involves the use of fluorescent dyes. In some cycle sequencing protocols, sequencing primers are labeled with fluorescent dye (e.g., using Amersham Bioscience MegaBACE DYEnamic ET Primers, ABI Prism® BigDye™ primer cycle sequencing kit, and Beckman Coulter WellRED fluorescence dye). Sequencing reactions using fluorescent primers offers advantages in accuracy and readable sequence length. However, separate reactions must be prepared for each nucleotide base for which sequence position is to be determined In other cycle sequencing protocols, fluorescent dye is linked to ddNTP as a dye terminator (e.g., using Amersham Bioscience MegaBACE DYEnamic ET Terminator cycle sequencing kit, ABI Prism® BigDye™ Terminator cycle sequencing kit, ABI Prism® dRhodamine Terminator cycle sequencing kit, LI-COR IRDye™ Terminator Mix, and CEQ Dye Terminator Cycle sequencing kit with Beckman Coulter WellRED dyes). Since dye terminators can be labeled with unique fluorescence dye for each base, sequencing can be done in a single reaction.

Thus, nucleic acids may be sequenced by: (a) mixing one or more templates to be sequenced with one or more fusion proteins (and optionally one or more nucleic acid synthesis terminating agents such as ddNTPs) to form a mixture; (b) incubating the mixture under conditions sufficient to synthesize a population of molecules complementary to all or a portion of the template to be sequenced; and (c) separating the population to determine the nucleotide sequence of all or a portion of the template to be sequenced.

Polymerase Chain Reaction (PCR)

Polymerase chain reaction (PCR), a well known DNA amplification technique, is a process by which DNA polymerase and deoxyribonucleoside triphosphates are used to amplify a target DNA template. In such PCR reactions, two primers, one complementary to the 3′ termini (or near the 3′-termini) of the first strand of the DNA molecule to be amplified, and a second primer complementary to the 3′ termini (or near the 3′-termini) of the second strand of the DNA molecule to be amplified, are hybridized to their respective DNA strands. After hybridization, DNA polymerase, in the presence of deoxyribonucleoside triphosphates, allows the synthesis of a third DNA molecule complementary to the first strand and a fourth DNA molecule complementary to the second strand of the DNA molecule to be amplified. This synthesis results in two double stranded DNA molecules. Such double stranded DNA molecules may then be used as DNA templates for synthesis of additional DNA molecules by providing a DNA polymerase, primers, and deoxyribonucleoside triphosphates. As is well known, the additional synthesis is carried out by “cycling” the original reaction (with excess primers and deoxyribonucleoside triphosphates) allowing multiple denaturing and synthesis steps. Typically, denaturing of double stranded DNA molecules to form single stranded DNA templates is accomplished by high temperatures. The fusion proteins described herein include those which are heat stable, and thus will survive such thermal cycling during DNA amplification reactions. Thus, these fusion proteins are ideally suited for PCR reactions, particularly where high temperatures are used to denature the DNA molecules during amplification. The fusion proteins may be used in all PCR methods known to one of ordinary skill in the art, including end-point PCR, real-time qPCR (U.S. Pat. Nos. 6,569,627; 5,994,056; 5,210,015; 5,487,972; 5,804,375; 5,994,076, the contents of which are incorporated by reference in their entirety), allele specific amplification, linear PCR, one step reverse transcriptase (RT)-PCR, two step RT-PCR, mutagenic PCR, multiplex PCR and the PCR methods described in copending U.S. patent application Ser. No. 09/599,594, the contents of which are incorporated by reference in their entirety.

Preparation of cDNA

The fusion proteins (SSB-reverse transcriptase enzymes) described herein may also be used to prepare cDNA from mRNA templates. See, for example, U.S. Pat. Nos. 5,405,776 and 5,244,797, the disclosures of which are incorporated herein by reference. Thus, the invention also relates to a method of preparing cDNA from mRNA, comprising (a) contacting mRNA with an oligo(dT) primer or other complementary primer to form a hybrid; and (b) contacting the hybrid formed in step (a) with a fusion protein of the invention and the four dNTPs, whereby a cDNA-RNA hybrid is obtained. If the reaction mixture is step (b) further comprises an appropriate oligonucleotide which is complementary to the cDNA being produced, it is also possible to obtain dsDNA following first strand synthesis. Thus, the invention is also directed to a method of preparing dsDNA with the fusion proteins described herein.

Another embodiment features compositions and reactions for nucleic acid synthesis, sequencing or amplification that include the fusion proteins of the invention. These mixtures include one or more fusion proteins, one or more dNTPs (dATP, dTTP, dGTP, dCTP), a nucleic acid template, an oligonucleotide primer, magnesium and buffer salts, and may also include other components (e.g., nonionic detergent). If sequencing reactions are performed, the reaction may also include one or more ddNTPs. The dNTPs or ddNTPs may be unlabeled or labeled with a fluorescent, chemiluminescent, bioluminescent, enzymatic or radioactive label. In some embodiments, compositions comprising one or more fusion proteins are formulated as described in PCT WO98/06736, the entire contents of which are incorporated herein by reference.

In some embodiments, kits are provided (e.g., for use in carrying out the methods described herein). Such kits may include, in addition to one or more fusion proteins, one or more components selected from the group consisting of: one or more host cells (preferably competent to take up nucleic acid molecules), one or more nucleic acids (e.g., nucleic acid templates), one or more nucleotides, one or more nucleic acid primers, one or more vectors, one or more ligases, one or more topoisomerases, and one or more buffers or buffer salts.

Analyzing and Typing Polymorphic DNA Fragments

In one embodiment, the relationship between a first individual and a second individual may be determined by analyzing and typing a particular polymorphic DNA fragment, such as a minisatellite or microsatellite DNA sequence. In such a method, the amplified fragments for each individual are compared to determine similarities or dissimilarities. Such an analysis is accomplished, for example, by comparing the size of the amplified fragments from each individual, or by comparing the sequence of the amplified fragments from each individual. In another aspect of the invention, genetic identity can be determined. Such identity testing is important, for example, in paternity testing, forensic analysis, etc. In this aspect of the invention, a sample containing DNA is analyzed and compared to a sample from one or more individuals. In one such aspect of the invention, one sample of DNA may be derived from a first individual and another sample may be derived from a second individual whose relationship to the first individual is unknown; comparison of these samples from the first and second individuals by the methods of the invention may then facilitate a determination of the genetic identity or relationship between the first and second a individual. In a particularly preferred such aspect, the first DNA sample may be a known sample derived from a known individual and the second DNA sample may be an unknown sample derived, for example, from crime scene material. In an additional aspect of the invention, one sample of DNA may be derived from a first individual and another sample may be derived from a second individual who is related to the first individual; comparison of these samples from the first and second individuals by the methods of the invention may then facilitate a determination of the genetic kinship of the first and second individuals by allowing examination of the Mendelian inheritance, for example, of a polymorphic, minisatellite, microsatellite or STR DNA fragment.

In another aspect of the invention, DNA fragments important as genetic markers for encoding a gene of interest can be identified and isolated. For example, by comparing samples from different sources, DNA fragments which may be important in causing diseases such as infectious diseases (of bacterial, fungal, parasitic or viral etiology), cancers or genetic diseases, can be identified and characterized. In this aspect of the invention a DNA sample from normal cells or tissue is compared to a DNA sample from diseased cells or tissue. Upon comparison according to the invention, one or more unique polymorphic fragments present in one DNA sample and not present in the other DNA sample can be identified and isolated. Identification of such unique polymorphic fragments allows for identification of sequences associated with, or involved in, causing the diseased state.

Gel electrophoresis is typically performed on agarose or polyacrylamide sequencing gels according to standard protocols using gels containing polyacrylamide at concentrations of 3-12% (e.g., 8%), and containing urea at a concentration of about 4-12M (e.g., 8M). Samples are loaded onto the gels, usually with samples containing amplified DNA fragments prepared from different sources of genomic DNA being loaded into adjacent lanes of the gel to facilitate subsequent comparison. Reference markers of known sizes may be used to facilitate the comparison of samples. Following electrophoretic separation, DNA fragments may be visualized and identified by a variety of techniques that are routine to those of ordinary skill in the art, such as autoradiography. One can then examine the autoradiographic films either for differences in polymorphic fragment patterns (“typing”) or for the presence of one or more unique bands in one lane of the gel (“identifying”); the presence of a band in one lane (corresponding to a single sample, cell or tissue type) that is not observed in other lanes indicates that the DNA fragment comprising that unique band is source-specific and thus a potential polymorphic DNA fragment.

Nucleic Acid Synthesis Compositions

Nucleic acid synthesis compositions can include one or more SSB-DNA polymerase or SSB-reverse transcriptase fusion proteins, one or more nucleotides, one or more primers, one or more buffers and/or one or more templates. In some embodiments, a nucleic acid synthesis reaction can include mRNA and a fusion protein having reverse transcriptase activity. These compositions can be used to improve the yield and/or homogeneity of primer extension products made during nucleic acid synthesis (e.g., cDNA synthesis, amplification and combined cDNA synthesis/amplification reactions).

Kits

The fusion proteins described herein are suited for the preparation of a kit. Kits comprising these fusion proteins may be used for detectably labeling DNA molecules, DNA sequencing, amplifying DNA molecules or cDNA synthesis by well known techniques, depending on the content of the kit. See U.S. Pat. Nos. 4,962,020, 5,173,411, 4,795,699, 5,498,523, 5,405,776 and 5,244,797, the disclosures of which are hereby incorporated by reference. Such kits may comprise a carrying means being compartmentalized to receive in close confinement one or more container means such as vials, test tubes and the like. Each of such container means comprises components or a mixture of components needed to perform DNA sequencing, DNA labeling, DNA amplification, or cDNA synthesis.

Such kits may include, in addition to one or more fusion proteins, one or more components selected from the group consisting of one or more host cells (preferably competent to take up nucleic acid molecules), one or more nucleic acids (e.g., nucleic acid templates), one or more nucleotides, one or more nucleic acid primers, one or more vectors, one or more ligases, one or more topoisomerases, and one or more buffers or buffer salts.

Kit constituents typically are provided, individually or collectively, in containers (e.g., vials, tubes, ampules, and bottles). Kits typically include packaging material, including instructions describing how the kit can be used for example to synthesize, amplify or sequence nucleic acids. A first container may, for example, comprise a substantially purified sample of each fusion protein. A second container may comprise one or a number of types of nucleotides needed to synthesize a DNA molecule complementary to DNA template. A third container may comprise one or a number of different types of dideoxynucleoside triphosphates. A fourth container may comprise pyrophosphatase. In addition to the above containers, additional containers may be included in the kit which comprise one or a number of DNA primers. A kit used for amplifying DNA will comprise, for example, a first container comprising a substantially pure fusion protein as described herein and one or a number of additional containers which comprise a single type of nucleotide or mixtures of nucleotides. Various primers may or may not be included in a kit for amplifying DNA. The various kit components need not be provided in separate containers, but may also be provided in various combinations in the same container. For example, the fusion protein and nucleotides may be provided in the same container, or the fusion protein and nucleotides may be provided in different containers.

Kits for cDNA synthesis comprise a first container containing a fusion protein, a second container containing the four dNTPs and the third container containing an oligo(dT) primer. See U.S. Pat. Nos. 5,405,776 and 5,244,797, the disclosures of which are incorporated herein by reference. Since the fusion proteins of the present invention are also capable of preparing dsDNA, a fourth container may contain an appropriate primer complementary to the first strand cDNA. Of course, it is also possible to combine one or more of these reagents in a single tube. When desired, the kit of the present invention may also include a container which comprises detectably labeled nucleotides which may be used during the synthesis or sequencing of a DNA molecule. One of a number of labels may be used to detect such nucleotides. Illustrative labels include, but are not limited to, radioactive isotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels and enzyme labels.

Having now generally described the embodiments, the same will be more readily understood through reference to the following examples which are provided by way of illustration, and are not intending to be limiting of the present invention.

EXAMPLES

The invention is further described in the following examples, which do not limit the scope of the inventions described in the claims.

Example 1 Cloning of Sso SSB-Tzi Polymerase Fusion Protein

The Sso SSB gene was cloned upstream and in-frame with the T. zilligii (Tzi) DNA polymerase gene in pET26b (Novagen). There is a 6-mer linker (GSSGVD) (amino acids 151-156 of SEQ ID NO: 2) between the Sso SSB region and the Tzi polymerase region. The DNA sequence of the open reading frame (ORF) for Sso SSB-Tzi is shown in SEQ ID NO: 1, and the corresponding amino acid sequence is shown in SEQ ID NO: 2. The fusion protein consists of an Sso portion (amino acids 1-150), a 6-mer amino acid linker (amino acids 151-156) and a Tzi portion (amino acids 157-941).

SEQ ID NO: 1: 5′-atggaagaaaaagtaggtaatctgaaaccaaatatggaaagcgtaaatgtaaccgtacgagttttggaagc aagcgaagcacgtcaaatccagacaaagaacggtgttcggacaatcagtgaggctattgttggagatgaaacg ggacgagtaaagttaacattatggggaaaacatgcaggtagtatcaaagaaggtcaagtggtaaagattgaaaa cgcgtggaccaccgcttttaagggtcaagtacagttaaatgctggaagcaaaactaagatcgctgaagcttcag aagatggatttccagaatcatctcaaattccagaaaatacaccaacagctcctcagcaaatgcgtggaggagga cgcggattccgcggtgggggacgtcggtatggacgacgtggtggtcgccggcaagaaaacgaagaaggtga agaggagggaagcggaggggtcgacatgatcctcgatgctgactacatcaccgaagacggaaagcccgtca taagggtcttcaagaaggaaaagggcgagtttaagatagactacgacagggactttgagccctacatctacgcc ctcctgaaggacgattccgccattgaggacatcaagaagatcaccgccgagaggcacggcaccaccgttaga gttacccgggcggagagggtgaagaagaagttcctcggcaggccggtggaggtctggaagctctacttcacc cacccccaggacgttcccgcgatcagggacaaaatcagggagcatccggcggttgttgacatctacgagtacg acatacccttcgcgaagcgctacctcatagacaggggcttaatccctatggagggggacgaggagctcaggat gctcgccttcgacatcgagacgctctaccatgagggggaggagtttggcgaggggcctatcctgatgataagct acgccgatgaagagggggcgcgcgttatcacctggaagaatatcgacctcccctacgtggagagcgtttctact gagaaagagatgataaagcgcttcctcaaggtaatccaggagaaggatccggatgtgctcataacctacaacg gcgacaacttcgactttgcttacctcaagaagcgctcagaaacgctcggcgtcaagttcatcctcggaagggac gggagcgaaccgaaaattcagcgcatgggcgaccgctttgcagtggaggtgaaggggagaatacacttcgac ctctacccggttataaggaggactattaacctccccacctacaccctcgagacagtctacgaggcgattttcggg caaccaaaggagaaggtctacgcggaagagatagcgcgggcctgggagagcggggaaggcttggaaagg gtggcccgctattccatggaggacgcaaaggcaacttacgaactcggaaaagagttcttcccgatggaggccc agctctcgcgcctcgtgggccagagcctctgggatgtatcgcgctcgagcacaggaaacttagttgagtggttt ctcctgaggaaggcctacgagaggaacgagctcgcgccaaacaagccggacgagagggagttagcaagga gagcggagagctacgcgggtggatatgtcaaagagcccgaaaaggggctgtgggagaacatagtctacctc gattacaaatctctctacccctcgataatcatcacccacaacgtctcccctgataccctcaacagggagggctgta gggagtacgacgtggcacctcaggtgggacaccgcttctgcaaggacttcccgggctttatcccgagcctcctc ggggaccttttggaggagaggcagaaggtaaagaagaaaatgaaggccacggtggacccgatagagagga agctcctcgactacaggcaacgcgccatcaagattctggccaacagttattacggctactacggctacgcaaat gcccgctggtactgcagggagtgcgccgagagcgttaccgcctggggcaggcagtatattgaaaccacgatg agggaaatagaggagaaatttggctttaaagtgctttacgcggataccgacggtttctttgccacgattcccgga gcggacgccgaaacggtcaaaaagaaggctaaagaattcctgaactacatcaaccccagactgcccggcctg ctcgagctggagtacgagggcttctacaggcgcggcttcttcgtgacgaagaagaagtacgcggttatagacg aggaggacaagataacgacgcgcgggctggaaatagtaaggcgcgactggagcgagatagcgaaggaga cgcaggcgagggttcttgaggcgatactcaagcacggtgacgtcgaagaggcagtaaggattgtcaaggagg tgacggaaaagctgagtaggtacgaggttccaccggagaagctcgtcatctacgagcagataacccgcgacct gagggactacagggccacggggccgcacgtggccgttgcaaaacgcctcgccgcgagggggataaaaatc cggcccgggacggtcataagctacatagtgctcaaaggcccgggaagggttggggacagggcgatacccttc gacgagttcgaccctgcaaagcaccgctatgatgcggaatactacatcgagaaccaggttcttccagcggtgg agaggattctgagggcctttggttaccgcaaagaggacttgaggtatcagaagacgaagcaggccggactgg gggcgtggctaaaaccgaagacataa-3′ SEQ ID NO: 2:   1 MMEEKVGNLK PNMESVNVTV RVLEASEARQ IQTKNGVRTI SEAIVGDETG RVKLTLWGKH                           Sso ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  61 AGSIKEGQVV KIENAWTTAF KGQVQLNAGS KTKIAEASED GFPESSQIPE NTPTAPQQMR              Sso                                   Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~               ~~~~~~~~~~~~ 121 GGGRGFRGGG RRYGRRGGRR QENEEGEEEG    GSGGVD     MILDADYITED                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 GKPVIRVFKK ENGEEKIDYD RDFEPYIYAL LKDDSAIEDI KKITAERHGT TVRVTRAERV                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 KKKFLGRPVE VWKLYFTHPQ DVPAIRDKIR EHPAVVDIYE YDIPFAKRYL IDRGLIPMEG                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 288 DEELRMLAFD IETLYHEGEE FGEGPILMIS YADEEGARVI TWKNIDLPYV ESVSTEKEMI                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 RFLKVIQEK DPDVLITYNG DNFDFAYLKK RSETLGVKFI LGRDGSEPKI QRMGDRFAVE                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 KGRIHFDLY PVIRRTINLP TYTLETVYEA IFGQPKEKVY AEEIARAWES GEGLERVARY                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 SMEDAKATYE LGKEFFPMEA QLSRLVGQSL WDVSRSSTGN LVEWFLLRKA YERNELAPNK                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 PDERELARRA ESYAGGYVKE PEKGLWENIV YLDYKSLYPS IIITHNVSPD TLNREGCREY                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 588 DVAPQVGHRF CKDFPGFIPS LLGDLLEERQ KVKKKMKATV DPIERKLLDY RQRAIKILAN                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 648 SYYGYYGYAN ARWYCRECAE SVTAWGRQYI ETTMREIEEK FGFKVLYADT DGFFATIPGA                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 708 DAETVKKKAK EFLNYINPRL PGLLELEYEG FYRRGFFVTK KKYAVIDEED KITTRGLEIV                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 768 RRDWSEIAKE TQARVLEAIL KHGDVEEAVR IVKEVTEKLS RYEVPPEKLV IYEQITRDLR                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 828 DYRATGPHVA VAKRLAARGI KIRPGTVISY IVLKGPGRVG DRAIPFDEFD PAKHRYDAEY                           Tzi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 888 YIENQVLPAV ERILRAFGYR KEDLRYQKTK QAGLGAWLKP KT

Example 2 Purification of Sso SSB-Tzi Polymerase Fusion Protein

pTTQ-SsoTzi fusion protein was expressed in E. coli BL21/DE3 cells containing a plasmid expressing two supplemental tRNAs: ArgU (arginine) and IleY (isoleucine). A 150 mL culture of these cells was grown in LB medium supplemented with 0.18% dextrose, 40 ug/mL chloramphenicol and 100 ug/mL ampicillin at 30° C., overnight. This culture was then diluted into six liters of LB medium and the cells were grown at 37° C. to an OD600 of ˜0.8-1.0 and then induced with 1 mM IPTG. The cells continued to grow for 3 hours post-induction and were then harvested by centrifugation at 6,000×g for 20 minutes.

After centrifugation, the cell pellet was resuspended in 3 ml of lysis buffer (50 mM Tris-HCl, pH 7.5, 1 mM EDTA, 8% glycerol, 10 mM KCl, 5 mM b-mercaptoethanol, 50 mg/ml phenylmethylsulfonyl fluoride) per gram of wet cell paste and lysed by sonication (70-80% lysis based on OD600). The lysate was then heat-treated for 30 minutes at 85° C., then immediately placed on ice and sodium chloride (NaCl) was added to a final concentration of 250 mM. Polyethylenimine (PEI; 2% v/v) was added dropwise to the lysate at 4° C. to a final concentration of 0.15% (v/v) and allowed to mix for 30 minutes at 4° C. The lysate was centrifuged for 1 hour in an SS-34 rotor at 17,500 rpm, and the supernatant was retained. A solid ammonium sulfate cut was performed on the supernatant to about 55% saturation while mixing at 4° C. The lysate was centrifuged for 30 min in a SS-34 rotor at 13,000 rpm, and the pellet was resuspended in low salt buffer (30 mM Tris-HCl, pH 7.5, 1 mM EDTA, 1 mM DTT, 10% glycerol, 50 mM NaCl) and dialyzed against low salt buffer overnight.

The solution was applied to a 5 ml EMD-SO4 column (1.6×5 cm) (EM Science) and equilibrated with the low salt buffer. The column was washed with 4 column volumes (cV) of low salt buffer and the protein was eluted with a 15 cV gradient from low salt buffer to 70% of high salt buffer (30 mM Tris HCl, pH 7.5, 1 mM EDTA, 1 mM DTT, 10% glycerol, 1,000 mM NaCl), followed by three cV wash at 70% high salt buffer. Two ml fractions were collected and analyzed by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) on 4-20% Tris-glycine gels (Novex) stained with Novex SimplySafe stain according to the manufacturer's instructions. Fractions containing the desired protein band were further analyzed by the Polymerase unit activity assay as follows.

DNA polymerase activity of the Sso SSB-Tzi polymerase fusion protein was assessed by the incorporation rate of radiolabeled nucleotides into a nicked salmon testes DNA template. In this assay, one polymerase unit corresponds to the incorporation of 10 nmol of deoxynucleotides into acid-precipitable material in 30 min. at 74° C. under standard buffer conditions. The nucleotide incorporation into acid-insoluble fractions was measured by spotting an aliquot of the reaction onto a GF/C filter, washing the filter with trichloroacetic acid (TCA) solution, and counting the amount of radioactivity on the filter using a scintillation counter.

For a standard unit assay, 5 μl of a dilution of Sso SSB-Tzi polymerase fusion protein was added to a set of 50 μl reactions. Each reaction contained 0.5 μg/μl of nicked salmon testes DNA and 0.2 mM of each dNTP (dATP, dCTP, dGTP, dTTP) in 1× unit assay buffer (25 mM TAPS, pH 9.3, 50 mM KCl, 2 mM MgCl₂, 1 mM DTT and 1 to 2 μCi [α-³²P] dCTP in a final volume of 50 μl per reaction.

The reaction was initiated upon addition of the fusion protein and transfer to a heating block equilibrated to 74° C. The reaction was continued for 10 min and terminated by adding 10 μl of 0.5 M EDTA to each of the 50 l reactions on ice. 40 μl each of the mixtures was spotted onto a GF/C filter for TCA precipitation. Reactions included a non-saturating amount of polymerase (i.e., in the range where activity linearly corresponds to the dilution factor).

TCA precipitation was performed as follows. The filters were washed in 10% TCA solution containing 1% sodium pyrophosphate for 15 min, in 5% TCA for 10 min three times, then in 95% ethanol for 10 min. The filters were dried under a heat lamp for 5 to 10 min and the radioactivity decay rate was measured in ScintiSafe Econo 1 scintillation cocktail (Fisher Scientific, part # SX20-5) using a Beckman scintillation counter (Model # LS 3801).

Fractions containing optimal polymerase activity were pooled and dialyzed against 2 liters of Resource Q low salt buffer (25 mM Tris-HCl, pH 8, 1 mM EDTA, 1 mM DTT, 10% glycerol, 50 mM NaCl).

The sample was applied to an 8 ml Source Q column (GE Healthcare, Amersham) equilibrated with Source Q low salt buffer (25 mM Tris-HCl, pH 8, 50 mM NaCl, 1 mM EDTA, 10% glycerol). The column was washed with 5 cV of low salt buffer and eluted with 20 cV of a linear gradient from low salt buffer to 50% of high salt buffer (25 mM Tris-HCl, pH 8, 1 mM EDTA, 1 mM DTT, 10% glycerol, 1,000 mM NaCl), followed by an additional three cV wash at 50% of high salt buffer. Two ml fractions were collected and analyzed by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) on 4-20% Tris-glycine gels (Novex) stained with Novex SimplySafe stain according to the manufacturer's instructions. Fractions containing the desired protein band were further analyzed by the polymerase unit activity assay. Fractions containing optimal polymerase activity were pooled and dialyzed against 2 liters of storage buffer (20 mM Tris-HCl, pH 8, 40 mM KCl, 0.1 mM EDTA, 1 mM DTT, 50% glycerol). The protein concentration was measured by UV absorbance as described below prior to addition of BigChap and Chapso since these detergents interfere with UV absorbance measurements. BigChap and Chapso were added to final concentrations of 1.2% and 0.12%, respectively. The resulting Sso-Tzi fusion protein was greater than 95% pure.

Example 3 UV Absorbance for Quantitation of Sso SSB-Tzi Polymerase Fusion Protein

The protein concentration determination for Sso SSB-Tzi polymerase fusion protein was performed by UV absorbance at 278 nm. The UV spectrum was measured using a Beckman Model DU-640 spectrophotometer in a Beckman micro quartz cell (8 mm) from 220 to 320 nm. The UV measurement was done in the absence of the detergents, BigChap and Chapso since these interfere with the measurements (see above). Absorbance at 320, 315, 278 and 260 nm were read from the spectrum. The absorbance at 320 and 315 nm were used to calculate slope of the baseline, while the absorbance at 278 and 260 nm were used to estimate the extent of nucleic acid contamination. Absorbance at 278 nm was calibrated by subtracting baseline, calculated from the slope of the baseline, using the equation: Abs(278)_(cal)=Abs(278)_(obs)−8.4×(Abs(315)_(obs)−Abs(320)_(obs))

where Abs(278)_(cal) is calibrated absorbance at 278 nm, and Abs(278)_(obs), Abs(310)_(obs) and Abs(320)_(obs) are measured absorbance at 278, 310 and 320 nm, respectively. If any of the values were higher than 2 OD, the protein solution was diluted with the storage buffer until all the values were below 2 OD. One mg/ml solution would have Abs(278)_(cal) at 1.27 (extinction coefficient). Therefore the concentration of the protein in a prep can be determined as: enzyme concentration(mg/ml)=Abs(278)_(cal)/1.27

DNA contamination can be detected by UV absorbance as well. DNA, especially single-stranded DNA, has a high extinction coefficient meaning a very small amount of DNA can register a high OD. Similarly to protein determination, the absorbance at 260 nm was calculated as below: Abs(260)_(cal)=Abs(260)_(obs)−12×(Abs(315)_(obs)−Abs(320)_(obs))

where Abs(260)_(cal) is calibrated absorbance at 260 nm, and Abs(260)_(obs), Abs(310)_(obs) and Abs(320)_(obs) are measured absorbance at 260, 310 and 320 nm, respectively. The higher the ratio of Abs(260)_(cal) to Abs(278)_(cal), the greater the amount of DNA contamination. An Abs(278)_(cal)/Abs(260)_(cal) ratio of about 2±10% deviation (1.8 to 2.2) is an acceptable level of DNA contamination in the enzyme prep. The Sso SSB-Tzi polymerase fusion proteins produced by the method described above had an Abs(278)_(cal)/Abs(260)_(cal) ratio in the acceptable range described above. The purified Sso-Tzi polymerase fusion protein preparations were also evaluated for the presence of nuclease contamination. Preparations of Sso-Tzi polymerases were found to contain no detectable endonuclease activity and contained little, if any, 5′-exonuclease contamination. 5′ exonuclease activity was measured using a DNA substrate radiolabeled at its 5′ end by a kinase. The release of the radiolabel was monitored by thin layer chromatography (TLC) after incubation for 1 h at 74° C.

In the double-stranded endonuclease assay, increasing amounts of Sso SSB-Tzi polymerase fusion protein in 1×Tzi buffer with an increased magnesium concentration (45 mM Tris-HCl, pH 8.2, 20 mM KCl, 15 mM (NH₄)₂SO₄, 5 mM MgSO₄, 0.14 mg/ml BSA) were added to supercoiled ds φX174 DNA. The ratio of supercoiled circular φX174 DNA to relaxed circular DNA was measured by 1.2% agarose gel electrophoresis. Higher endonuclease activity results in conversion of supercoiled to relaxed circular DNA at an elevated rate. The relaxed circular DNA migrates at a reduced rate on the gel compared to the supercoiled circular DNA.

Example 4 PCR Conditions and Optimization for Sso SSB-Tzi Polymerase Fusion Protein

PCR reactions were prepared in 50 μl reaction volumes, unless indicated otherwise. The typical 50 μl reaction contained five units Tzi High Fidelity polymerase, 1×Tzi buffer (45 mM Tris-HCl, pH 8.2, 20 mM KCl, 15 mM (NH₄)₂SO₄, 1.2 mM MgSO₄, 0.14 mg/ml BSA), 0.3 μM of each primer, 0.3 mM dNTP mix, and template concentration that varied from 10-100 pg for plasmids, 10 ng to 1 μg cDNA, and 50-100 ng (genomic DNA). The PCR reactions were set-up at room temperature and run following a standard protocol, unless otherwise stated. Thermocycling was conducted using either the Perkin Elmer GeneAmp PCR System 9600 or the Perkin Elmer GeneAmp PCR System 2400.

Standard PCR Program:

94° C. 2 minutes 35 cycles of 94° C. 15 seconds 55° C.-65° C. 10 seconds 72° C. 30 to 60 seconds/kb Post PCR elongation 72° C. 10 minutes Hold at 14° C.

Following the completion of thermocycling, PCR amplification products were mixed with 5 μl of 10× BlueJuice and aliquots (10 μl of total reaction volume per each lane) were analyzed by electrophoresis through an 0.8% agarose gel in 0.5×TBE containing ethidium bromide at a concentration of 0.5 μg/ml. The resulting gels were analyzed visually for specificity and yield among different samples.

Example 5 DNA Polymerase Fidelity Assay

The DNA polymerase fidelity assay is based on streptomycin resistance (Lackovich et al., Focus 23:6-7 (2001); Fujii et al., J. Mol. Biol. 289:596-601 (1999)). Briefly, pMOL 21 plasmid DNA (4 kb), containing the ampicillin (Apr) and (rpsL) genes, was linearized with Sca I and standard PCR was performed on the linearized product using biotinylated primers. Amplification was done using 5 units of Sso SSB-Tzi polymerase fusion protein. Template DNA (10 ng) was subjected to 35 cycles of amplification. PCR cycling parameters were: 94° C. for 5 min, followed by 35 cycles of 94° C. for 15 s, 58° C. for 10 s, and 72° C. for 5 min, with a 10 minute final elongation at 72° C. The PCR product was streptavidin-magnetic-bead-purified and analyzed on an agarose gel to estimate DNA concentration and template doubling. The purified DNA was ligated with T4 DNA ligase and transformed into MF101 competent cells. Cells were plated on ampicillin plates to determine the total number of transformed cells. Cells were plated on media containing ampicillin and streptomycin to determine the total number of rpsL mutants (i.e., number of streptomycin mutants). Mutation frequency was determined by dividing the total number of rpsL mutants by the total number of transformed cells.

Shown in Table 2 are the relative fidelities of Taq, Tzi (wild type), SsoSSB-Tzi polymerase, and SsoSSB-Tzi polymerase containing and lacking a His6 tag at the C-terminus A relative fidelity of one corresponds to an error rate of 34.13±9.56×10⁻⁶. The data shown in Table 2 reflect the averages of at least three determinations.

TABLE 2 Fidelity of SsoSSB-Tzi polymerase fusion protein and wild type Tzi polymerase Relative Enzyme Fidelity Taq 1 Tzi (wild type) 15 SsoSSB-Tzi polymerase, 52 (His)6 SsoSSB-Tzi polymerase, 53 Tagless

Table 2 Example 6 PCR Amplification with SsoSSB-Tzi Polymerase

SsoSSB-Tzi polymerase was also tested using a range of genomic targets. The targets tested were A) Rhod 462 bp, b) p53 1494 bp, C) Rhod 2497 bp, D) Rhod 3123 bp, and E) Rhod 3871 bp. Standard PCR was used (Example 4) with 100 ng K562 cell DNA as a template. The results are shown in FIG. 1. Both Phusion and iProof had difficulty generating the smallest targets, while Phusion also was unable to amplify the longer 3871 bp fragment. Sso SSB-Tzi polymerase fusion protein had the greatest yields overall, with iProof performing slightly below Sso SSB-Tzi polymerase fusion protein for three of the five targets.

It will be readily apparent to one of ordinary skill in the relevant arts that other suitable modifications and adaptations to the methods and applications described herein may be made without departing from the scope of the invention or any embodiment thereof. It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the claims. Having now described the present invention in detail, the same will be more clearly understood by reference to the following examples, which are included herewith for purposes of illustration only and are not intended to be limiting of the invention. 

What is claimed is:
 1. An isolated or purified single stranded DNA binding protein (SSB)-nucleic acid polymerase fusion protein wherein said SSB comprises amino acids 1 to 150 of SEQ ID NO: 2, wherein said fusion protein comprises at least 50% polymerase activity, 3′-5′ exonuclease activity and/or 5′-3′ exonuclease activity of the fusion protein shown in SEQ ID NO:2, and wherein the presence of said SSB enhances fidelity of said fusion protein compared to an identical nucleic acid polymerase that is not joined to said SSB.
 2. The fusion protein of claim 1 wherein said nucleic acid polymerase is a DNA polymerase.
 3. The fusion protein of claim 1, wherein said nucleic acid polymerase is a reverse transcriptase.
 4. The fusion protein of claim 1, wherein said nucleic acid polymerase is a thermostable DNA polymerase.
 5. The fusion protein of claim 4, wherein said thermostable DNA polymerase is Thermococcus zilligi (Tzi) DNA polymerase.
 6. The fusion protein of claim 1, wherein said nucleic acid polymerase comprises amino acids 157 to 941 of SEQ ID NO:2.
 7. An isolated or purified Sulfolobus sofataricus (Sso) SSB-Thermococcus zilligi (Tzi) DNA polymerase fusion protein wherein said Sso SSB comprises amino acids 1 to 150 of SEQ ID NO:2 and has at least 50% polymerase activity, 3′-5′ exonuclease activity and/or 5′-3′ exonuclease activity of the fusion protein shown in SEQ ID NO:2.
 8. A fusion polymerase protein that comprises an amino acid sequence that has at least 75% sequence identity to SEQ ID NO:2 and has at least two joined heterologous domains, wherein said heterologous domains comprise: a. a single-stranded DNA binding protein (SSB) domain comprising amino acids 1 to 150 of SEQ ID NO:2; and b. a polymerase domain, and wherein the relative fidelity of said polymerase fusion protein is enhanced compared to the fidelity of the corresponding full length wild type polymerase protein lacking said SSB domain.
 9. The fusion protein of claim 1, wherein said isolated or purified single stranded DNA binding protein (SSB)-nucleic acid polymerase fusion protein comprises no detectable endonuclease activity.
 10. The fusion protein of claim 1, wherein said fidelity is enhanced by about 1.2 to 10,000 fold compared to the fidelity of an identical nucleic acid polymerase that is not joined to said SSB.
 11. The fusion protein of claim 1, wherein said fusion protein exhibits a reduced misincorporation rate that is less than 90% of the relative misincoporation rate of an identical nucleic acid polymerase that is not joined to said SSB.
 12. The fusion protein of claim 4, wherein said thermostable DNA polymerase retains at least 50% of its nucleic acid synthesis activity after bein heated in a nucleic acid synthesis mixture at 90° C. for 30 seconds.
 13. The fusion protein of claim 1, wherein said SSB is at the N-terminal and said nucleic acid polymerase is at the C-terminal end of the fusion protein.
 14. The fusion protein of claim 1, wherein said SSB is upstream (5′) from said nucleic acid polymerase.
 15. The fusion protein of claim 1, wherein said SSB and said nucleic acid polymerase are immediately adjacent to each other.
 16. The fusion protein of claim 1, wherein said SSB and said nucleic acid polymerase are separated by an amino acid linker.
 17. The fusion protein of claim 16, wherein said amino acid linker is between 1 and 100 amino acids in length. 